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APPLICANTS' BRIEF ON APPEAL TO THE 
BOARD OF PATENT APPEALS AND INTERFERENCES 



Sir: 

This Appeal Brief is submitted in response to the Notice of Non-Compliance with 
37 C.F.R. § 1.192(c) mailed from the United States Patent Office on March 11, 2003. 

This Brief in support of an appeal to the Board of Patent Appeals and 
Interferences from the final rejection of claims 3-6, 11, and 13-16 in the above-referenced 
application is submitted in triplicate as required by 37 C.F.R. 1.192. Also enclosed are 
Appendices A-D which are referenced and relied upon herein. 
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polypeptides are capable of binding to the HLA-DR protein to form a complex which 
activates T cells which become autoreactive and initiate the autoimmune disease. The 
preparations of the present invention specifically do not include such polypeptides but, 
rather, include other antigens from the pathogen." 

The specification states, at page 8, lines 14-24: "In particular embodiments, such 
pharmaceutical preparations are provided in which the HLA-DR protein is HLA-DR4 
protein and the autoimmune disease is pemphigus vulgaris. In addition, a particular 
sequence motif is provided for pemphigus vulgaris and pharmaceuticals which lack 
peptides having this motif are provided. Specific embodiments of the pharmaceuticals 
include preparations lacking each of the polypeptides described above with respect to 
pemphigus vulgaris." 

Although those teachings are summarized above, the Board is strongly urged to 
study the specification before considering the rejections on appeal. A copy of the 
specification is attached as Appendix B. A copy of the parent application, U.S. Serial No. 
08/400,796, which issued as U.S. Patent No. 5,874,53 1 on February 23, 1999, is attached 
as Appendix C. 

ISSUES 

1 . The first issue presented for appeal is whether claims 3-6 and 1 1 , which 
directly or indirectly recite a "human polypeptide consisting essentially of an amino acid 
sequence corresponding to the core MHC binding residues of a sequence motif for an 
HLA-DR protein" are unpatentable under 35 U.S.C. § 1 12, 1 st paragraph as being based 
on a nonenabling disclosure. 

2. The second issue presented for appeal is whether claim 6, which recites 
"an amino acid sequence consisting essentially of an amino acid sequence selected from 
the group consisting of SEQ ID NO.: 1, SEQ ID NO.: 2, SEQ ID NO.: 3, SEQ ID NO.: 4, 
SEQ ID NO.: 5, SEQ ID NO.: 6, and SEQ ID NO.: 7" is unpatentable under 35 U.S.C. § 
112, 1 st paragraph as being based on a nonenabling disclosure. 
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3. The third issue presented for appeal is whether claims 13-16, which 
directly or indirectly recite a "human pathogen that in its native form includes a 
polypeptide having an amino acid sequence corresponding to the core MHC residues of a 
sequence motif for an HLA-DR protein" is unpatentable under 35 U.S.C. § 1 12, 1 st 
paragraph as being based on a nonenabling disclosure. 

4. The fourth issue presented for appeal is whether claim 16, which recites 
"an amino acid sequence consisting essentially of an amino acid sequence selected from 
the group consisting of SEQ ID NO.: 1, SEQ ID NO.: 2, SEQ ID NO.: 3, SEQ ID NO.: 4, 
SEQ ID NO.: 5, SEQ ID NO.: 6, and SEQ ID NO.: 7" is unpatentable under 35 U.S.C. § 

1 12, 1 st paragraph as being based on a nonenabling disclosure. 

5. The fifth issue presented for appeal is whether claims 13-16 are 
unpatentable under 35 U.S.C. § 1 12, 2 nd paragraph as being indefinite. 

6. The sixth issue presented for appeal is whether the double patenting 
rejection of claims 3-6 and 13-16 over claim 3 of U.S. Patent No. 5,874,531 is proper. 

7. The seventh issue presented for appeal is whether appealed claims 3-5 are 
novel in view of Amagai et al. (Autoantibodies against a Novel Epithelial Cadherin in 
Pemphigus Vulgaris, a Disease of Cell Adhesion, Cell Vol., 67, 869-877 (1991), 
hereinafter, "Amagai"). 

8. The eighth issue presented for appeal is whether appealed claims 13-15 are 
novel in view of Amagai. 

9. Although Applicant believes that the above-identified issues correspond to 
all of the grounds for the pending rejections, Applicants also appeal any other bases for 
rejection of the pending claims which were not explicitly stated in the final Office action, 
but which may be regarded as still pending. 
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GROUPING OF CLAIMS 

With regard to Issue 1, if rejected claim 3 is patentable under 35U.S.C. § 112, 1 st 
paragraph, then claims 3-6 and 1 1 are all patentable under 35 U.S.C. § 1 12, 1 st paragraph. 
Otherwise, claims 6 and 1 1 stand or fall separately from claims 3-5 for the reasons set 
forth in the argument for Issue 2. 

With regard to Issue 3, if rejected claim 13 is patentable under 35 U.S.C. § 1 12, 
1 st paragraph, then claims 13-16 are all patentable under 35 U.S.C. § 1 12, 1 st paragraph. 
Otherwise, claim 16 stands or falls on its own for the reasons set forth in the argument for 
Issue 4. 

With regard to Issue 5, rejected claims 13-16 stand or fall together. 
With regard to Issue 6, rejected claims 3-6 and 13-16 stand or fall together. 
With regard to Issue 7, rejected claims 3-5 stand or fall together. 
With regard to Issue 8, rejected claims 13-15 stand or fall together. 

ARGUMENT 

Pursuant to 37 C.F.R. §1.192(c)(8)(iv), the following sections describe antigen 
presentation, discuss the legal standard applicable to the instant application, indicate the 
specific limitations in the rejected claims that are not disclosed in the applied references, 
and explain how such limitations render the claimed subject matter patentable. 

Autoimmune disease is generally defined as the pathological consequences, 
including tissue injury, produced by autoantibodies or autoreactive T cells interacting 
with self epitopes. An "epitope" is defined as "an antigenic determinant ... the simplest 
form or smallest structural area on a complex antigen molecule that can combine with an 
antibody or T lymphocyte receptor." (The Illustrated Dictionary of Immunology, CRC 
Press (1995)) Self epitopes are derived from an individual's own proteins. The 
inappropriate response of the immune system against self epitpopes in autoimmune 
disease can cause serious damage to cells and organs, sometimes with fatal consequences. 
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The distinction between an antigen and an epitope is critical to the understanding 
of the present invention. The terms "autoantigen," and "self antigen," as understood in 
the art and used in the instant specification, refer to a full-length protein or a polypeptide 
that includes a self epitope which, as stated above, is the simplest form or smallest 
structural area on a complex antigen molecule that can combine with an antibody or T 
lymphocyte receptor. 

T cell recognition of antigens, including self antigens, involves a tri-molecular 
complex of (1) a T cell receptor, (2) an MHC molecule, such as HLA-DR, and (3) a short 
peptide comprising an epitope. (See for example, page 6, lines 27-30, original claims 3 
and 13, and pending claims 3 and 13 which require the that the polypeptides of the 
invention be "capable of binding to HLA-DR protein to form a complex which activates 
autoreactive T cells in subject having the autoimmune disease.") When the MHC 
molecule is a Class II molecule, such as HLA-DR, the peptides derived from antigen are 
heterogeneous in size. (See page 5, lines 8-10.) Peptides that bind HLA-DR molecules 
have been reported as ranging in size from 12-25 amino acid long. (See Chicz et al., 
"Predominantly naturally processed peptides bound to HLA-DR1 are derived from MHC- 
related molecules and are heterogeneous in size," Nature , 358:764-768 (1992); cited on 
page 5, at line 10 of the original specification.) 

Particular MHC class II molecules (e.g., HLA-DR) have been closely associated 
with autoimmune disease. For example, HLA-DR4 has been associated with the 
autoimmune condition pemphigus vulgaris (PV). PV is an autoimmune disease of the 
skin which is manifested by blistering lesions of the skin and mucous membranes. (See 
page 2, lines 13-17.) 

The present specification teaches methods of identifying peptides that form part of 
the tri-molecular complex with autoreactive T cells and HLA-DR molecules. Such 
peptides are useful to treat autoimmune diseases, for example, by inducing high dose 
tolerance to thereby render autoantigenic T cells unresponsive to self epitope bound to an 
HLA-DR molecule. (See page 30, lines 14-31 and page 52, lines 11-15.) 
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A large body of clinical and epidemiological evidence suggests that bacterial or 
viral infection may trigger the induction of autoimmunity. (See page 3, lines 13-17.) One 
theory suggests that peptides of a pathogen that closely resembling self peptides "mimic" 
self epitopes to activate autoreactive T cells and induce autoimmunity. Accordingly, the 
present invention also provides for methods of vaccinating an individual against an 
autoimmune condition by vaccinating the individual against the pathogen with a 
preparation that specifically excludes autoreative peptides from the pathogen, that is, 
peptides that form part of the complex with autoreactive T cells and HLA-DR molecules. 
(See page 32, lines 9-29.) 

Issue 1 - Rejection of Claims 3-6 and 1 1 Under 35 U.S. C. § 1 12, First Paragraph 

Claims 3-6 and 1 1 were rejected under 35 U.S.C. § 1 12, first paragraph, as 
allegedly failing to meet the enablement requirement. In particular, the Office action 
suggested that the specification does not enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and/or use the invention. 

The Office action states that "the specification, while being enabling for a 
pharmaceutical preparation comprising a human polypeptide consisting of one of SEQ 
IDS NOS: 1-7, does not reasonably comprise a human polypeptide consistin g^essentiaUv 
of an amino acid sequence corresponding to the core MHC binding residues of a 
sequence mottf foraiLHLA^ nor consisting essentially >pf ~pne of SEQ 

NOS: 1-7." (Emphasis added.) 



Applicants ' Rebuttal of the Rejection of Claims 3-6 and 11 under 35 U.S.C. § 112, First 
Paragraph 

According to MPEP § 21 1 1.03: "the transitional phrase 'consisting essentially of 
limits the scope of a claim to the specified materials or steps 'and those that do not 
materially affect the basic and novel characteristic(s)' of the claimed invention," MPEP § 
21 11.03, quoting In re Herz , 537 F.2d 549, 551-52, 190 USPQ 461, 463 (C.C.P.A. 1976) 
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(emphasis in original). "A 'consisting essentially of claim occupies a middle ground 
between closed claims that are written in a 'consisting of format and fully open claims 
that are drafted in a 'comprising' format." PPG Industries v. Guardian Industries . 156 
F.3d 1351, 1354, 48 USPQ2d 1351, 1353-54 (Fed. Cir. 1998). The phrase "consisting 
essentially of "exclude[s] additional unspecified ingredients which would affect the 
basic and novel characteristics of the product defined in the balance of the claim." In re 
Garnero, 412 F.2d 276, 279, 162 USPQ 221, 223 (C.C.P.A. 1969). 

As explained below, Applicants have clearly identified the "basic and novel 
characteristics" of the present invention and therefore the "consisting essentially of 
phrase should be construed as open to those materials that do not materially affect the 
basic and novel characteristics and closed to those materials that do materially affect the 
basic and novel characteristics of the claimed invention. 

In brief, the present application teaches methods of "defining those amino acids of 
the self or non-self antigen that are needed for MHC binding and TCR [T cell receptor] 
contact" so that "self epitopes involved in autoimmune disease may be identified." (See 
page 5, lines 24-3 1 .) It is these self epitopes (and non-self epitopes that mimic self 
epitopes) that are the active agents of the pharmaceutical preparations for tolerization 
recited in claims 3-6 and 1 1 . Accordingly^the basic and novel characteris tics o f t he 
claimed polypeptides include the ability (1) bind to HLA-I^]£otei^ 
autoreactive T cells from a subject having an autoimmune disease. 

Applicants submit that independent claim 3 (as well as claims 4-6 and 1 1 that 
depend from claim 3) which claims polypeptides that "consist essentially of an amino 
acid sequence corresponding to the core MHC binding residues of the sequence motif for 
an HLA-DR molecule . . . [which] binds to said HLA-DR protein [and] activates 
autoreactive T cells from a subject having said autoimmune disease" embraces only 
polypeptides that bind to HLA-DR and activate autoreactive T cells, and specifically 
excludes peptides that are incapable of binding to HLA-DR and/or activating autoreactive 
T cells. 
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Furthermore, the specification expressly states that "peptides including at least the 
MHC binding and TCR contact residues are contemplated as equivalents." (See page 28, 
lines 27-30.) Thus, the present specification clearly indicates that the basic and novel 
characteristics of the claimed polypeptides are the ability to (1) bind to HLA-DR protein 
and (2) activate autoreactive T cells from a subject having an autoimmune disease. 
Furthermore, the claims explicitly require that the preparations of the invention comprise 
polypeptide that "binds to [said] HAL-DR protein" and "activates autoreactive T cells." 

For the foregoing reasons, Applicants submit that the instant specification fully 
enables pending claims 3-6 and 1 1 when the term "consisting essentially of is interpreted 
properly as partially open and partially closed. Under this construction of the "consisting 
essentially of phrase, claims 3-6 and 1 1 include polypeptides with an amino acid 
sequence corresponding to the core MHC binding residues of a sequence motif for HLA- 
DR protein and other components (e.g., additional amino acids) that do not materially 
affect the ability of the polypeptides to bind to HLA-DR molecules and activate 
autoreactive T cells from a subject having an autoimmune disease. 

Full-length proteins or long polypeptides do not fall within the scope of claims 3- 
6 and 1 1 because, as was notoriously well known in the art at the time the application was 
filed, MHC molecules do not bind full-length proteins or long polypeptides from an " x ^ 
antigen. MHC molecules bind short peptides that are small fragments of an antigen. The 1^3 ^ 
distinction between an antigen and an epitope is critical to the understanding of the ^ 
present invention. The terms "autoantigen," and "self antigen," as understood in the art 
and used in the instant specification, refers to full-length protein or polypeptide that 
includes a self epitope. As stated above, an "epitope" is an antigenic determinant . . . the 
simplest form or smallest structural area on a complex antigen molecule that can combine 
with an antibody or T lymphocyte receptor. 

The present application is directed to identification of self epitopes and 
preparations that utilize such self epitopes. The present specification explains that 
although the target antigens implicated in the immunopathogenesis of disease have been 
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identified, specific self epitopes have not been identified. See, for example, page 23, 
lines 15-20, which reads: "An ever increasing number of autoimmune diseases are now 
being associated with particular alleles of the MHC class II HLA-DR locus. For most of 
these autoimmune diseases, the self epitope remains unknown . For some, however, the 
self protein involved in autoimmune response is known or suspected. In one aspect of the 
present invention, a method is provided for identifying the self epitopes involved in 
autoimmune diseases associated with HLA-DR alleles ." (Emphasis added.) The 
pharmaceutical preparations of claims 3-6 and the method of claim 1 1 utilize these self 
epitopes to tolerize (i.e., render autoreactive T cells unresponsive to the self epitope, see 
page 52, lines 12-17) an individual to an autoantigen through MHC class II presentation. 

The specification reiterates the distinction between an autoantigen and self epitope 
in the description of the autoimmune disease pemphigus vulgaris at page 37, lines 16-23 
"Although the autoantigen for pemphigus vulgaris is known, the precise epitopes within 
the autoantigen have previously remained unknown. Using the methods of the present 
invention, it has been possible to identify a small set of peptides that may serve as the 
autoantigenic determinants. The target antigen of pemphigus vulgaris is an epithelial 
adhesion molecule of the cadherin family, desmolgein 3 (Amagai et al., 1991)." As stated 
in the Abstract, "The peptides relating to pemphigus vulgaris are self epitopes," thus, the 
polypeptides recited in claims 3-6 and 1 1 are essentially self epitopes that form part of the 
HLA-DR/T cell receptor/peptide complex that causes autoimmune disease and they are 
necessarily relatively small peptides. 

Furthermore, the working examples of the polypeptides of the invention that are 
disclosed in the instant specification (SEQ ID NO. 1-7) are peptides that are 15 amino 
acids long, "partly as a result of the computer database search program used (Genetics 
Computer Group program "Findpatterns") but also corresponding to the size of the cleft 



in MHC class II molecules" (See page 37, lines 10-14; emphasis added.) Thus, the 
instant specification teaches that polypeptides capable of binding to HLA-DR proteins 
may be approximately 1 5 amino acids long. However, the instant specification also 
teaches that shorter or longer peptides "may have utility" and thus, "fall within the spirit 
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and the scope of the claims" (See page 28, lines 16-30.) Thus, pending claim 3 which 
recites a pharmaceutical preparation for tolerization . . . consisting essentially of an amino 
acid sequence corresponding to the core MHC binding residues of a sequence motif for 
an HLA DR protein, . . . [which] binds to said HLA-DR protein [and] activates 
autoreactive T cells" embraces polypeptides that consist of self epitopes (which may be 
heterogeneous in size) and excludes full length proteins and large protein fragments that 
are too large to form the HLA-DR/T cell receptor/peptide complex. 

The Office action suggests that "there is no guidance in the specification as to 
what alterations result in a functional polypeptide, i.e., one that binds HLA-DR." 
(Emphasis added.) Applicants submit that the present specification provides detailed 
analysis of the binding pockets of HLA-DR molecules which properly teaches the skilled 
artisan how to predict which peptides are capable of binding to HLA-DR molecules. For 
example, the claims 5 and 15 provide a motif to identify peptides that are capable binding 
to HLA-DR molecules and activating T cells associated with the autoimmune disease 
pemphigus vulgaris. 

The Office action also suggests that undue experimentation would be required to 
determine what substitutions would be acceptable to retain functional activity. 
Applicants note that "The test of enablement is not whether any experimentation is 
necessary, but whether, if experimentation is necessary, it is undue ." In re Angstadt 537 
F.2d 498, 504, 190 USPQ 214, 219 (CCPA 1976). Moreover, Applicants submit that 
neither "extensive experimentation" nor "undue experimentation" is required to practice 
the claimed invention. 

The identification of amino acid sequences that have functional activity including 
the ability to bind to HLA-DR proteins and to activate autoreactive T cells may be readily 
determined by various assays that were well known to the skilled artisan at the time the 
present application was filed. For example, functional activity may determined using the 
T cell proliferation assay described in Example 1 of the specification at page 44, lines 11- 
22. 
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According to MPEP § 2164.01: "The fact that experimentation may be complex 
does not necessarily make it undue, if the art typically engages in such experimentation. 
In re Certain Limited-Charge Cell Culture Microcarriers . 221 USPQ 1165, 1174 (Infl 
Trade Comm'n 1983), aff d. sub nom. Massachusetts Institute of Technology v. A.B. 
Fortia, 774 F.2d 1 104, 227 USPQ 428 (Fed. Cir. 1985)." MPEP § 2164.01. The 
specification teaches, at page 25, lines 3-13, that the peptides of the invention may be 
screened for activity using experimentation that is typical in the art, for example, by "in 
vitro tests for the ability to induce the proliferation of autoreactive T cells or to induce the 
secretion of lymphokines (cytokines) from these T cells or to induce other effector 
functions such as cytotoxicity." Accordingly, Applicants submit that undue 
experimentation is not required to practice the claimed invention. 

The Office action cites the Ngo reference ("The Protein Folding Problem and 
Tertiary Structure Prediction," Merz & LeGrans, Birkhauser, Boston, pages 491-495, 
1994) to support the assertion that the relationship between the sequence of a polypeptide 
and the tertiary structure is not predictable. As is well known in the art, proteins are 
cleaved into fragments of approximately 12-25 amino acids and bound in the binding 
cleft of the HLA-DR molecule in an unfolded or essentially linear conformation. See 
page 1 16 of Cellular and Molecular Immunology , attached at Appendix D, which states 
"T cells recognize only linear determinants of peptides defined predominantly by primary 
amino acid sequences that assume extended confirmations within the peptide-binding 
clefts of MHC molecules." Thus, the tertiary structure of the peptide does not materially 
impact the ability of a self epitope polypeptide to bind to HLA-DR molecules and activate 
autoreactive T cells. Accordingly, Applicants submit that the difficulty in predicting the 
folding pattern or tertiary structure of full-length proteins discussed in Ngo is irrelevant to 
the ability of the peptides of the invention to bind to HLA-DR molecules and/or activate 
autoreactive T cells. 

Therefore, in light of the foregoing reasons, Applicants respectfully request that 
the rejections of claims 3-6 and 1 1 under 35 U.S.C. § 1 12, first paragraph, be 
reconsidered and withdrawn. 
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Issue 2 - Rejection of Claims 6 and 1 1 Under 35 U.S.C. § 112, First Paragraph 

Claims 3-6 and 1 1 were rejected under 35 U.S.C. § 112, first paragraph, as 
allegedly failing to meet the enablement requirement. In particular, the Office action 
suggested that the specification does not enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and/or use the invention. As 
discussed above, Applicants have respectfully requested that the rejections of claims 3-6 
and 1 1 under 35 U.S.C. § 1 12, first paragraph, be reconsidered and withdrawn. For the 
following reasons. Applicants submit that even if the rejection of claims 3-5 under 35 
U.S.C. § 1 12, first paragraph, is not withdrawn at this time, the rejection of claims 6 and 
1 1 should be reconsidered and withdrawn. 

Applicants 9 Further Rebuttal of the Rejection of Claims 6 and 11 under 35 U.S.C. § 112, 
First Paragraph 

Claim 6 depends from claim 4, which depends in turn from claim 3. "A claim in 
dependent form shall be construed to incorporate by reference all the limitations of the 
claim to which it refers." 35 U.S.C. § 1 12, fourth paragraph (2003). Claim 4 requires an 
isolated human polypeptide that binds to HLA-DR4 protein and, bound to HLA-DR4 
protein, activates autoreactive T cells from a subject having pemphigus vulgaris. Claim 6 
requires that the polypeptide consist essentially of an amino acid sequence consisting 
essentially of an amino acid sequence selected from the group consisting of SEQ ID 
NOs.: 1-7. 

The transition "consisting essentially of excludes from the amino acid sequence 
of claim 6 additional moieties materially affecting the basic and novel characteristics of 
the product. As set forth within the claims themselves and in the specification, the basic 
and novel characteristics of the polypeptide of claim 6 include that the polypeptide binds 
to HLA-DR4 protein and that the polypeptide bound to HLA-DR4 protein activates 
autoreactive T cells from a subject having pemphigus vulgaris. Thus, the amino acid 
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sequence is closed to the presence of additional moieties materially affecting the ability of 
the polypeptide to bind to HLA-DR4 protein and to activate autoreactive T cells from a 
subject having pemphigus vulgaris. 

Claim 1 1 depends in the alternative from claims 4-6 and incorporates by reference 
all of the limitations of the particular claim in relation to which it is being considered. 
See 35 U.S.C. §112, fifth paragraph (2003). 

The outstanding rejection of claims 6 and 1 1 acknowledges that the specification 
enables pharmaceutical preparations "comprising a human polypeptide consisting of one 
of SEQ ID NOS: 1-7." Thus, the only question is whether the specification also enables 
pharmaceutical preparations comprising human polypeptides with one or more additional 
moieties that do not materially affect the ability of the polypeptide to bind to HLA-DR4 
protein and to activate autoreactive T cells from a subject having pemphigus vulgaris. 

Many of the arguments made in the Office action to support the rejections of 
claims 3-6, 1 1 and 13-16 are irrelevant to claim 6. To support the rejection of claims 3-6, 
1 1 and 13-16 under 35 U.S.C. § 1 12, the Office action asserts that "the amino acids at a 
maximum of three of the motif positions may not be motif amino acids and may actually 
be deleterious to binding" and that "extended experimentation . . . would be required to 
determine which substitutions would be acceptable." Neither of these assertions is 
relevant to claim 6. The polypeptide of claim 6 consists essentially of an amino acid 
sequence consisting essentially of one of SEQ ID NOs: 1-7. No substitutions within any 
of the seven SEQ ID NOs are permitted. 

The Office action also asserts that the presence of amino acid residues outside the 
"core" may be deleterious to binding. It is, however, uncontested that peptides 12-25 
amino acids in length can bind productively to HLA-DR4. See , e.g , page 28, lines 16-30 
of the specification (teaching that shorter or longer peptides "may have utility" and "fall 
within the spirit and the scope of the claims"). 

Furthermore, Applicants submit that any experimentation to confirm the ability of 
a human polypeptide consisting essentially of an amino acid sequence consisting 
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essentially of an amino acid sequence selected from the group consisting essentially of 
SEQ ID NOs: 1-7 would be routine and not undue. As noted in the specification, for 
example, functional activity may be determined using a T cell proliferation assay. See 
page 44, lines 1 1-22 of the specification. The specification teaches, at page 25, lines 3- 
13, that the peptides of the invention may be screened for activity using experimentation 
that is typical in the art, for example, by "in vitro tests for the ability to induce the 
proliferation of autoreactive T cells or to induce the secretion of lymphokines (cytokines) 
from these T cells or to induce other effector functions such as cytotoxicity." In view of 
the availability of routine methods to screen the peptides for activity, the relatively 
narrow scope of the claims and the express teachings in the specification regarding how 
to make and use the invention with the high level of skill in the art, Applicants 
respectfully request that the rejections of claims 6 and 1 1 under 35 U.S.C. § 1 12, first 
paragraph, be reconsidered and withdrawn. 

Issue 3 - Rejection of Claims 13-16 Under 35 U.S.C. $ 1 12, First Paragraph 

Claims 13-16 were rejected under 35 U.S.C. § 1 12, first paragraph, as allegedly 
failing to meet the enablement requirement. In particular, the Office action suggested that 
the specification does not enable any person skilled in the art to which it pertains, or with 
which it is most nearly connected to make and/or use the invention. 

Applicants note that the Office action also rejected claims 3-6 and 1 1 under 35 
U.S.C. § 1 12, first paragraph, as allegedly failing to meet the enablement requirement. 
Because the language of independent claim 13 differs from the language of independent 
claim 3, and for the reasons which follow. Applicants submit that claims 13-16 should 
stand or fall independently from claims 3-6 and 11. 

The Office action alleges that claim 13 is not enabled for preparations for 
immunizing an individual against a "human pathogen that in its native form includes a 
polypeptide having an amino acid sequence corresponding to the core MHC binding 
residues of a sequence motif for an HLA-DR protein." 
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Applicants f Rebuttal to the Rejection of Claims 13-16 under 35 U.S.C. § 112, First 
Paragraph 

The Office action alleges that claim 13 is not enabled for preparations for 
immunizing an individual against a "human pathogen that in its native form includes a 
polypeptide having an amino acid sequence corresponding to the core MHC binding 
residues of a sequence motif for an HLA-DR protein." 

"Transitional phrases such as "composed of," "having, 55 or "being 55 must be 
interpreted in light of the specification to determine whether open or closed claim 
language is intended. See, e.g., ReRents of the Univ. of Cal. v. Eli Lilly & Co. , 119 F.3d 
1559, 1573, 43 USPQ2d 1398, 1410 (Fed. Cir. 1997), cert, denied, 118 S. Ct. 1548 
(1998) (construing "having, 55 in the context of a cDNA having a sequence coding for 
human PI, to permit inclusion of other moieties). MPEP § 21 1 1.03. 

Applicants note that the objected to phrase recited in claim 13 sets forth the target 
of the immunization, a pathogen that in its native form includes polypeptides having 
amino acid sequence corresponding to the core MHC binding residues of a sequence 
motif; however, the final limitation of the claim requires that the pharmaceutical 
preparation be free of autoantigens (i.e., peptides with amino acid sequence 
corresponding to the polypeptide that binds to HLA-DR protein and activates autoreactive 
T cells from a subject having the autoimmune disease). Thus, the epitopes described in 
the specification and recited in clams 3-6 and 1 1 are removed from the preparations of 
claims 13-16 to vaccinate an individual against a pathogen associated with autoimmune 
disease without exposing the individual at risk to the autoimmune disease to autoantigens 
that are native to that pathogen. 

The specification teaches that preparation for vaccinating a person at risk of an 
autoimmune disease of the invention may be used to induce immunity against a pathogen 
associated with autoimmunity without exposing the individual to the autoantigens 
associated with the pathogen. Thus, the phrase "having 5 ' as used in claim 13 is an open 
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transitional phrase, which includes polypeptides native to the pathogen. However, the 
"free of the amino acid sequence" corresponding to the core MHC binding residues 
limitation specifically excludes epitopes that bind to HLA-DR and activate autoreactive T 
cells. 

Applicants submit, for the reasons stated above, that the instant specification 
properly teaches the skilled artisan how to identify peptides that bind to HLA-DR 
molecules associated with autoimmune disease and activate autoreactive T cells in a 
person having such an autoimmune disease. The instant specification further teaches the 
skilled artisan that an individual at risk of an autoimmune disease associated with a 
human pathogen can be immunized against the pathogen with a preparation that can 
include polypeptides of the pathogen but that certainly excludes (i.e., is free of) epitopes 
that activate autoreactive T cells in a person inflicted with the autoimmune disease. 

Therefore, in light of the foregoing reasons, Applicants respectfully request that 
the rejections of claims 13-16 under 35 U.S.C. § 1 12, first paragraph, be reconsidered and 
withdrawn. 

Issue 4 - Rejection of Claim 16 Under 35 U.S.C. § 1 12, First Paragraph 

Claims 13-16 were rejected under 35 U.S.C. § 1 12, first paragraph, as allegedly 
failing to meet the enablement requirement. In particular, the Office action suggested that 
the specification does not enable any person skilled in the art to which it pertains, or with 
which it is most nearly connected, to make and/or use the invention. As discussed above, 
Applicants have respectfully requested that the rejections of claims 13-16 under 35 
U.S.C. § 1 12, first paragraph, be reconsidered and withdrawn. For the following reasons. 
Applicants submit that even if the rejection of claims 13-15 under 35 U.S.C. § 1 12, first 
paragraph, is not withdrawn at this time, the rejection of claims 16 should be reconsidered 
and withdrawn. 
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Applicants 9 Further Rebuttal to the Rejection of Claim 16 under 35 U.S.C. §112, First 
Paragraph 

Claim 16 depends from claim 14, which in turn depends from claim 13. Thus, 
claim 16 relates to a pharmaceutical preparation comprising a pharmaceutically 
acceptable carrier and an amount of an immunogenic preparation effective to immunize 
against a human pathogen that in its native form includes a polypeptide that binds to 
HLA-DR4 protein and, bound to the HLA-DR4 protein, activates autoreactive T cells 
from a subject having pemphigus vulgaris. The immunogenic preparation is free of a 
polypeptide corresponding to a sequence consisting essentially of an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-7. 

Applicants submit that it is straightforward, given a known amino acid sequence 
such as that of elected SEQ ID NO: 3, to omit that sequence from a protein preparation 
using biochemical or recombinant DNA technologies. If the sequence is absent, the 
immunogenic preparation will be free of a polypeptide corresponding to a sequence 
consisting essentially of SEQ ID NO: 3. Accordingly, Applicants submit that preparation 
of an appropriate pharmaceutical preparation is well within the high level of skill in the 
art. The instant specification further teaches the skilled artisan that an individual at risk 
of an autoimmune disease associated with a human pathogen can be immunized against 
the pathogen with a preparation that can include polypeptides of the pathogen but that 
certainly excludes (i.e., is free of) epitopes that activate autoreactive T cells in a person 
inflicted with the autoimmune disease. 

In view of the relatively narrow scope of the claim and the express teachings in 
the application coupled with the high level of skill in the art, Applicants respectfully 
request that the rejections of claims 13-16 under 35 U.S.C. § 1 12, first paragraph, be 
reconsidered and withdrawn. 



Issue 5 - Rejection of Claims 13-16 Under 35 U.S.C. § 1 12, Second Paragraph 
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Claims 13-16 were rejected under 35 U.S.C. § 1 12, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
Applicants regard as the invention. In particular, the Office action suggests that the 
recital of the phrase "wherein said preparation is free of a polypeptide corresponding to 
said sequence" is unclear. 

Applicants ' Rebuttal of the Rejection of Claims 13-16 under 35 U.S.C. § 112, 2 nd 
Paragraph 

Applicants submit that claim 13 clearly claims a pharmaceutical for vaccinating 
an individual at risk of an autoimmune disease by vaccinating against a human pathogen 
with a preparations which specifically excludes (is free of) autoantigenic peptides, that is, 
pathogenic polypeptides corresponding to a sequence motif for HLA-DR protein 
associated with an autoimmune disease. The text of claim 13 reads as follows: 

A pharmaceutical preparation for vaccinating an individual 
at risk of an autoimmune disease comprising a 
pharmaceutically acceptable carrier and 

an amount of an immunogenic preparation effective 
to immunize against a human pathogen that in its native 
form includes a polypeptide having an amino acid sequence 
corresponding to the core MHC binding residues of a 
sequence motif for an HLA-DR protein; 

wherein said sequence motif for said HLA-DR 
protein is based upon the structure of the HLA -DR binding 
site 

wherein said HLA-DR protein is associated with said 
autoimmune disease; 

wherein said polypeptide binds to said HLA-DR 
protein; 

wherein said polypeptide bound to said HLA-DR 
protein activates autoreactive T cells from a subject having 
said autoimmune disease; and 

wherein said preparation is free of a polypeptide 
corresponding to said sequence. (Emphasis added.) 
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The Office action suggests that the phrase "free of a polypeptide corresponding to 
said sequence" is indefinite. Applicants submit that this phrase is amenable to a single 
interpretation, specifically, the preparation is free of autoantigenic peptides, that is, 
peptides that bind to HLA-DR and activate autoreactive T cells are not present in the 
pharmaceutical preparation. 

The Office action further suggests that the recitation of "includes a polypeptide" is 
unclear because it is not clear whether said polypeptide is a portion of a protein from a 
pathogenic organism. This phrase, taken in the context of the limitation "immunize 
against a human pathogen that in its native form includes a polypeptide having an amino 
acid sequence corresponding to the core MHC binding residues of a sequence motif for 
an HLA-DR protein" clearly indicates that the polypeptide recited in claim 13 is a portion 
of a protein from the pathogenic organism. 

Applicants submit that claim 13 satisfies the requirements of 35 U.S.C. § 1 12, 
second paragraph, respectfully request that the rejection of claim 13 and claims 14-16 
(which depend from claim 13) under 35 U.S.C. § 1 12, second paragraph, be reconsidered 
and withdrawn. 

Issue 6 - Double Patenting 

Claims 3-6 and 13-16 were rejected under the judicially created doctrine of 
obviousness-type double patenting as being unpatentable over claim 3 of U.S. Patent No. 
5,874,531 ("the '531 patent"). The Office action states that "although the conflicting 
claims are not identical, they are patentably indistinct from each other because the 
composition comprising the peptides of claim 3 of the '531 patent are encompassed by 
the instant claims. Applicants note that the instant application is a division of the 
application that issued as the '531 patent and, as such, 35 U.S.C. § 121 prohibits the use 
of the 4 53 1 patent as a reference against the instant application. Thus, Applicants 
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respectfully request that the double patenting rejection over claim 3 of the '53 1 patent be 
reconsidered and withdrawn. 

Issue 7 - Rejections of Claims 3-5 Under 35 U.S.C. S 102(b) 

Claims 3-5 were rejected under 35 U.S.C. § 102(b) as being anticipated by 
Amagai. The Office action states that "in the absence in the specification of a definition 
of a 'human polypeptide consisting essentially of . . . \ the claim language is open, and 
inclusive of the full-length autoantigen." 

Applicants* Rebuttal to the Rejection of Claims 3-5 under 35 U.S.C. § 102(b) 

As stated above, with regard the § 112, first paragraph rejection, Applicants 
submit that the specification clearly teaches the basic and novel characteristics of the 
invention, and the transitional phrase is therefore partially open and partially closed. 

The disclosed invention provides a binding motif to determine which residues of a 
putative antigenic protein are capable of binding autoimmune associated HLA-DR 
proteins and activating autoreactive T cells. As stated above, independent claims 3 and 
13 are intended to embrace polypeptides capable of binding autoimmune associated HLA 
proteins, not a full-length autoantigenic protein as described in Amagai. Thus, 
Applicants submit that Amagai fails to teach or even suggest the invention recited in 
claims^ -5. 

Although Amagai teaches that desmoglein 3 is the full length autoantigen for 
pemphigus vulgaris, Amagai fails to teach or even suggest what short peptides make up 
the self epitopes for pemphigus vulgaris. Applicants submit that inclusion of the full- 
length 103 kD pemphigus vulgaris antigen protein disclosed by Amagai in the 
preparations of claim 3 would materially affect the basic and novel characteristics of the 
preparation and, therefore, the full-length pemphigus vulgaris protein is excluded from 
the scope of independent claim 3. Applicants further submit that one of ordinary skill in 
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the art would readily recognize that a preparation for tolerization consisting essentially of 
an isolated peptide capable of binding to HLA-DR and activating autoreactive T cells is 
superior to a preparation containing a full-length protein. 

In conclusion, Applicants submit that independent claim 3 is not anticipated by 
Amagai. Applicants further submit that Amagai does not anticipate claims 4 and 5, 
which depend from claim 3. In light of the foregoing, Applicants respectfully request that 
the 35 U.S. C § 102(b) rejection of claims 3-5 be reconsidered and withdrawn. 

Issue 8 - Rejections of Claims 13-15 Under 35 U.S.C. § 102(b) 

Claims 13-15 were rejected under 35 U.S.C. § 102(b) as being anticipated by 
Amagai. The Office action states that "in the absence in the specification of a definition 
of a 'human polypeptide consisting essentially of the claim language is open, and 
inclusive of the full-length autoantigen." 

Applicants note that the Office action also rejected claims 3-5 under 35 U.S.C. § 
102(b) as allegedly anticipated by Amagai. Because the language of independent claim 
13 differs from the language of independent claim 3, and for the reasons which follow. 
Applicants submit that claims 13-16 should stand or fall independently from claims 3-6 
and 11. 

Applicants ' Rebuttal of the Rejection of Claims 13-15 Under 35 U.S.C § 102(b) 

Applicants submit that Amagai fails to anticipate pending claims 13-15 for the 
following reasons. Independent claim 13 recites, in part, a vaccination preparation "that 
in its native preparation includes a polypeptide having an amino acid sequence 
corresponding to a sequence motif for an HLA-DR protein ... wherein said preparation is 
free of a polypeptide corresponding to said sequence" amino acid sequence corresponding 
to a sequence motif for an HLA-DR protein." As stated above, Amagai fails to teach 
what portions or segments of the desmoglein 3 protein contain the autoantigen for 
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pemphigus vulgaris. Furthermore, Amagai fails to teach a vaccination preparation which 
includes antigenic polypeptides of a pathogen and excludes polypeptides that activated 
autoreactive T cells from a subject having an autoimmune disease. Thus, Amagai does 
not anticipate claims 13-15. 

In conclusion, Applicants submit that independent claim 13 is not anticipated by 
Amagai. Furthermore, Amagai fails to anticipate claims 14-15, which depend from claim 
13. 

9. The Claimed Invention Is Not Unpatentable Under Any Other Possible Bases 

Applicants believe that the foregoing arguments address each of the pending 
rejections of the pending claims. In particular, the present Brief addresses each of the 
rejections made in the final Office action. However, if the Examiner regards any of other 
rejections as currently pending, Applicant requests that any and all such rejections be 
raised in the Examiner's Answer so that Applicant has an opportunity to respond. 
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CONCLUSION 



For the reasons given above, it is respectfully urged that the final rejections be 
reversed and the application be passed to issue with claims 3-6, 11, and 13-16. 

Applicants believe that no petition or fee is required at this time. If any petition or 
fee is required, please consider this a conditional petition therefor and authorization to 
charge Deposit Account No. 20-053 1 . 



Respectfully submitted, 



Date: April 11,2003 
Reg. No. 48,645 
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3. A pharmaceutical preparation for tolerization comprising a pharmaceutically acceptable 
carrier and 

an amount of an isolated human polypeptide effective for tolerizing an individual to an 
autoantigen, said human polypeptide consisting essentially of an amino acid sequence 
corresponding to the core MHC binding residues of a sequence motif for an HLA-DR protein; 

wherein said sequence motif for said HLA-DR protein is based upon the structure of the 
HLA-DR binding site; 

wherein said HLA-DR protein is associated with a human autoimmune disease; 

wherein said polypeptide binds to said HLA-DR protein; 

wherein said polypeptide bound to said HLA-DR protein activates autoreactive T cells 
from a subject having said autoimmune disease; and 

wherein said polypeptide is a non-collagen and non-myelin basic protein polypeptide. 

4. The pharmaceutical preparation of claim 3 wherein said HLA-DR protein is an 
HLA-DR4 protein and said autoimmune disease is pemphigus vulgaris. 

5. The pharmaceutical preparation of claim 4 wherein said motif is PV motif #1 . 

6. The pharmaceutical preparation of claim 4 wherein said amino acid sequence consists 
essentially of an amino acid sequence selected from the group consisting of SEQ ID NO.: 1, SEQ 
ID NO.: 2, SEQ ID NO.: 3, SEQ ID NO.: 4, SEQ ID NO.: 5, SEQ ID NO.: 6, and SEQ ID NO.: 
7. 

11. A method of tolerizing an individual to an autoantigen of pemphigus vulgaris 
comprising 

administering an effective amount of the pharmaceutical preparation of any one of 
claims 4-6 to a subject in need of such treatment. 
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13. A pharmaceutical preparation for vaccinating an individual at risk of an autoimmune 
disease comprising a pharmaceutically acceptable carrier and 

an amount of an immunogenic preparation effective to immunize against a human 
pathogen that in its native form includes a polypeptide having an amino acid sequence 
corresponding to the core MHC binding residues of a sequence motif for an HLA-DR protein; 

wherein said sequence motif for said HLA-DR protein is based upon the structure of the 
HLA -DR binding site; 

wherein said HLA-DR protein is associated with said autoimmune disease; 
wherein said polypeptide binds to said HLA-DR protein; 

wherein said polypeptide bound to said HLA-DR protein activates autoreactive T cells 
from a subject having said autoimmune disease; and 

wherein said preparation is free of a polypeptide corresponding to said sequence. 

14. The pharmaceutical preparation of claim 13 wherein said HLA-DR protein is an 
HLA-DR4 protein and said autoimmune disease is pemphigus vulgaris. 

1 5 . The pharmaceutical preparation of claim 1 4 wherein said motif is P V motif # 1 . 

16. The pharmaceutical preparation of claim 14 wherein said amino acid sequence consists 
essentially of an amino acid sequence selected from the group consisting of SEQ ID NO.: 1, SEQ 
ID NO.: 2, SEQ ID NO.: 3, SEQ ID NO.: 4, SEQ ID NO.: 5, SEQ ID NO.: 6, and SEQ JD NO.: 
7. 
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IDENTIFICATION OF SELF AND NON-SELF ANTIGENS 

IMPLICATED IN AUTOIMMUNE DISEASE - — 

Field of the Invention 
The present invention relates to the field of immunology 
and, in particular, to the identification of self and non-self 
antigens implicated in human autoimmune responses. The 
invention relates to methods of identifying such self and 
non-self antigens and provides examples of such antigens 
relating to multiple sclerosis and pemphigus vulgaris. The 
invention also relates to the use of such antigens for -in 
vitro assays, animal models, therapeutic agents and vaccines. 

n^rjc_grn:inri of the Invention 
Human autoimmune diseases have a striking genetic 
association with particular alleles of major 

histocompatability complex ( "MHC" ) class I or class II genes. 
The field was established by the seminal discovery of HLA-B27 
linked susceptibility to ankylosing spondylitis, a chronic 
inflammatory joint disease (Brewerton et al., 1973; 
Schlosstein et al., 1973). MHC associated susceptibility has 
now been documented for a variety of human autoimmune 
diseases, including insulin dependent diabetes mellitus. 
(IDDM), rheumatoid arthritis <RA>, pemphigus vulgaris (PV), 
multiple sclerosis (MS) and myasthenia gravis (MG) , just to 
name a few (Todd et al . , 1987; Ahmed et al., 1990; Ahmed et 
al. 1991; Lanchbury & Panayi, 1991; Spielman & Nathenson. 
1982; Protti et al., 1993). 

The MHC locus most commonly associated with autoimmune 
disease is the HLA-DRB locus (also known as DRB1), a highly 
polymorphic locus with over fifty known alleles. For example, 
a large body of epidemiological work has documented the 
association of rheumatoid arthritis with the DR4 <DRBl*0401. 
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DRB1*0404 ) and DRl (DRB1*0101) alleles, with the DR4 alleJ.es. 
conferring a higher risk than DRl (Lanchbury & Panayi, 1991). 
The risk is dramatically increased when the subject is 
homozygous or heterozygous for DRB1*0401 and/or DRB1*0404. 
The observation that arthritis is associated with three DR 
alleles that are structurally similar led to the development 
of the 'shared epitope' hypothesis as DRB1*0401, 0404 and 0101 
share critical polymorphic residues in the DRB 67-71 cluster 
(Gregersen et al . 1987; Lanchbury & Panayi, 1991). These 
residues (in particular DRB 71) appear to be critical in 
defining the selectivity of peptide binding to the disease 
associated molecules. 

Pemphigus vulgaris is an autoimmune disease of the skin in 
which high titer autoantibody production to an epidermal cell 
adhesion molecule (desmoglein 3) results in a loss of 
keratinocyte adhesion (acantholysis ) and severe blister 
formation (Amagai et al . , 1991). In different ethnic groups 
the disease is associated either with a DR4 allele (DRB1*0402) 
or with a rare DQl allele (DQB1*05032) ; only a small fraction 
of PV patients have neither susceptibility gene (Ahmed et al . , 
1991; Ahmed et al., 1990; Scharf et al . , 1988b). The DR4 
subtype associated with pemphigus differs only at three - 
positions in the DRB 67-71 cluster from the DR4 subtype 
associated with RA . The PV associated molecule has a negative 
charge (Glu) at the critical position (DRB 71); the 
neighboring position (DRB 70) is also negatively charged. The 
DR4 subtype associated with PV is the only one that carries a 
negative charge at DRB 71; a positive charge (Arg) is found at 
DRB 71 in the RA associated DR4 molecules. 

With respect to multiple sclerosis, recent immunological 
studies suggest that myelin basic protein (MBP) may be one of 
the important target antigens in the immunopathogenesis of the 
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disease. Several studies have demonstrated that MBP specific 
T cells are clonally expanded in MS patients and in an in vivo 
activated state (Allegretta et al., 1990; Wucherpf ennig et 
al., 1994b; Zhang et al . , 1994). Reactivity with the 
immunodominant MBP( 84-102) peptide is found predominantly in 
subjects carrying HLA-DR2 (the most common subtype of which is 
DRB1*1501), a genetic marker for susceptibility to MS. The 
MBP(84-102) epitope can also be presented by other MHC class 
II antigens, including HLA-DQ1 (Ota et al . , 1990; Martin. et 
al., 1990; Pette et al., 1990; Wucherpf ennig et al., 1934a). 
In vivo , the T cell response to this peptide appears to be 
dominated by a few expanded clones. 

While associations between MHC alleles and disease states 
have implicated autoimmunity in the aetiology of these 
diseases, a large body of clinical and epidemiological 
evidence suggests that infections may be important in the 
induction of autoimmunity. For example, particular viral 
infections frequently precede autoimmune myocarditis and type 
I diabetes (IDDM) (Rose et al . , 1986; Ray et al . , 1980). 
Environmental agents also influence the risk of developing 
multiple sclerosis as demonstrated by migration studies. 
Individuals that migrate after age 15 carry the risk foe 
developing MS associated with their geographic origin while 
individuals who migrate earlier in life acquire the risk of 
the geographical region to which they migrated (Kurtzke, 
1985). These studies are consistent with the hypothesis that 
a group of pathogens that are relatively ubiquitous in a 
certain geographic region influence the risk of developing 
multiple sclerosis (MS). The mechanism(s) leading to clonal 
expansion of MBP reactive T cells remain to be identified but 
could involve recognition of viral peptides with sufficient 
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structural similarity to the immunodominant MBP peptide The 
initiation of autoimmunity by such a mechanism could then lead 
to sensitization to other CNS self antigens by determinant 
spreading (Lehmann et al., 1992; Kaufman et al. 1993; Tisch et 
al., 1993). Consonant with this hypothesis, it has been noted 
that inflammatory CNS disease can follow infection with a 
number of common viral pathogens, such as measles and 
rubella. On the other hand, the absence of virus in the CNS 
of these patients and reactivity to myelin basic protein. in 
these patients suggest an autoimmune mechanism (Johnson~et 
al-, 1984). 

Efforts to identify sequence homologies between self 
peptide epitopes that might be involved in autoimmunity and 
various bacterial and viral pathogens have therefore been 
made. These homology searches have focused on alignments with 
sequence identity. No success has been reported using such 
alignments in identifying epitopes from pathogens that could 
cross react with presumably pathogenic T cell lines from human 
patients with autoimmune disease (Oldstone, 1990). A sequence 
identity was recently found between an epitope in 'a Coxsackie 
virus protein and GAD65, suspected of being an autoantigen in 
diabetes. These peptides could reciprocally generate 
polyclonal T cell lines from mice that cross react with the 
other peptides (Tian, et al., 1994). No evidence, however, 
was provided that these peptides could stimulate clones from 
diabetic mice (or humans). 

Recent developments in the field, in particular the 
identification of allele specific peptide binding motifs have 
transformed the field (Madden et al., 1991; Rbtschke & Falk, 
1991). Based on this knowledge the structural basis for MHC 
linked susceptibility to autoimmune diseases can be reassessed 
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at a level of detail sufficient for solving longstanding 
questions in the field. Motifs for peptide binding to several 
MHC class I and class II molecules have been defined by 
sequence analysis of naturally processed peptides and by 
mutational analysis of known epitopes. MHC class I bound 
peptides were found to be short (generally 8-10 amino acids 
long) and to possess two dominant MHC anchor residues; MHC 
class II bound peptides were found to be longer and more 
heterogeneous in size (Madden et al . , 1991; Rdtschke & Falk, 
1991; Jardetzky et al. 1991, Chicz et al. 1992, 1993). - Due to 
the size heterogeneity, however, it has proven more difficult 
to define MHC class II binding motifs based on sequence 
alignments. More recently, a crystal structure for HLA-DR1 
demonstrated that there is a dominant hydrophobic anchor 
residue close to the N-terminus of the peptide and that 
secondary anchor residues are found at several other peptide 
positions (Brown et al . , 1993). Even this work, however, 
could not provide a detailed description of the binding 
pockets of HLA-DR proteins, the particular residues involved 
in the formation of these pockets or the structural 
requirements or antigens for MHC binding. 

In the present disclosure, a detailed description of the 
\ HLA-DR antigen binding pockets is provided (Stern et al-, 
1994). With this information, together with functional 
information defining those amino acids of the self or non-self 
antigen that are needed for MHC binding and TCR contact (e.g., 
Wucherpfennig et al. 1994, 1995) , binding motif s for. the * 
various HLA-DR allotypes may be developed, self epitopes 
involved in autoimmune disease may be identified and a method 
is provided for identifying bacterial and viral epitopes which 
may initiate a human autoimmune response. 
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Summary of the Invention 
The present invention provides, in one aspect, seven 
different isolated polypeptides derived from the human 
desmoglein 3 protein and implicated as self epitopes in the 
autoimmune disease pemphigus vulgaris (PV) . These peptides 
consist essentially of the seven amino acid sequences 
disclosed herein and designated SEQ ID NO.: 1 through and SEQ 
ID NO. : 7. In particular, the invention provides isolated 
peptides which consist of these sequences, the core MHC 
binding residues of these sequences, or the inner core MHC 
binding residues of these sequences. 

The present invention provides, in another aspect, eight 
different isolated polypeptides derived from human pathogens 
but implicated in the aetiology of the autoimmune disease 
multiple sclerosis. These peptides consist essentially of the 
eight amino acid sequences disclosed herein and designated SEQ 
ID NO. : 8 through SEQ ID NO.: 15. In particular, the 
invention provides isolated peptides which consist of these 
sequences, the core MHC binding residues of these sequences , 
or the inner core MHC binding residues of these sequences. 

In another set of embodiments, the invention provides for 
pharmaceutical preparations for use in tolerizing individuals 
to autoantigens. The preparations include a pharmaceutical^ 
acceptable carrier and an isolated human polypeptide which 
includes an amino acid sequence corresponding to a sequence 
motif for an HLA-DR protein which is associated with a human 
autoimmune disease. These polypeptides are capable of binding 
to the HLA-DR protein to form a complex which activates 
autoreactive T cells in subjects having the autoimmune 
disease. The peptides are not derived from human collagen or 
human myelin basic protein. 
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In particular embodiments, such pharmaceutical 
preparations are provided in which the HLA-DR protein is 
HLA-DR4 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, a particular sequence motif is 
provided for pemphigus vulgaris and pharmaceuticals having 
peptides with this motif are provided. Specific embodiments 
of the pharmaceuticals include each of the polypeptides 
described above with respect to pemphigus vulgaris. Thus, 
methods of tolerizing an individual to a pemphigus vulgaris 
autoantigen are also provided. 

In another set of embodiments, the invention provides for 
pharmaceutical preparations for use in tolerizing individuals 
to antigens of human pathogens which are implicated in human 
autoimmune disease. The preparations include a 
pharmaceutical^ acceptable carrier and an isolated human 
pathogen polypeptide which includes an amino acid sequence 
corresponding to a sequence motif for an HLA-DR protein which 
is associated with a human autoimmune disease. These 
polypeptides are capable of binding to the HLA-DR protein to 
form a complex which activates autoreactive T cells in 
subjects having the autoimmune disease. 

In particular embodiments, such pharmaceutical 
preparations are provided in which the HLA-DR protein is 
HLA-DR2 protein and the autoimmune disease is multiple 
sclerosis. In addition, three particular sequence motifs are 
provided for multiple sclerosis and pharmaceuticals having 
peptides with at least one of these motifs are provided. 
Specific embodiments of the pharmaceuticals include each of 
the polypeptides described above with respect to multiple 
sclerosis. Thus, methods of tolerizing an individual to a 
multiple sclerosis foreign antigen are also provided. 
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In another aspect of the invention, pharmaceuticals are_ 
provided for vaccination against a human pathogen implicated 
in the aetiology of autoimmune disease. These pharmaceutical 
preparations include a pharmaceutical ly acceptable carrier and 
an immunogenic preparation effective to immunize against a 
human pathogen. The human pathogen is one which in its native 
form includes a polypeptide having an amino acid sequence 
corresponding to a sequence motif for an HLA-DR protein which 
is associated with the autoimmune disease. These polypeptides 
are capable of binding to the HLA-DR protein to form a complex 
which activates T cells which become autoreactive and intiate 
the autoimmune disease. The preparations of the present 
invention specifically do not include such polypeptides but, 
rather, include other antigens from the pathogen. 

In particular embodiments, such pharmaceutical 
preparations are provided in which the HLA-DR protein is 
HLA-DR 4 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, a particular sequence motif is 
provided for pemphigus vulgaris and pharmaceuticals which lack 
peptides having this motif are provided. Specific embodiments 
of the pharmaceuticals include preparations lacking each of 
the polypeptides described above with respect to pemphigus 
vulgaris. Thus, methods of immunizing an individual against 
pathogens which may cause pemphigus vulgaris are also provided. 

Similarly, pharmaceutical preparations are provided in 
which the HLA-DR protein is HLA-DR2 protein and the autoimmune 
disease is multiple sclerosis. Three particular sequence 
motifs are provided for multiple sclerosis and pharmaceuticals 
which lack peptides having any of these motifs are provided. 
Specific embodiments of the pharmaceuticals include 
preparations lacking each of the polypeptides described above 
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with respect to multiple sclerosis. Thus, methods of 
immunizing an individual against pathogens which may cause 
multiple sclerosis are also provided. 

The pharmaceutical preparations for immunizing against 
pathogens that may cause multiple sclerosis may particularly 
include inactivated forms of the pathogen in which the 
polypeptide corresponding to the motif are removed or 
recombinantly altered. Thus, the invention provides such 
vaccines in which the pathogen and polypeptide are, 
respectively, Herpes simplex virus and UL15 protein, Herpes 
simplex virus and SEQ ID NO.: 8, Adenovirus and Adenovirus ORF 
protein, Adenovirus and SEQ ID NO.: 09, Pseudomonas aeruginosa 
and phosphomannomutase protein, Pseudomonas aeruginosa and SEQ 
ID NO.: 10, Papillomavirus and L2 protein, Papillomavirus and 
SEQ ID NO.: 11, Epstein-Barr virus and DNA polymerase protein, 
Epstein-Barr virus and SEQ ID NO.: 12, Influenza virus and 
hemagglutinin protein, Influenza virus and SEQ ID NO.: 13, 
Reovirus and sigma 2 protein, Reovirus sand SEQ ID NO. : 14, 
Herpes simplex virus and DNA polymerase, and Herpes simplex 
and SEQ ID NO.: 15. 

The present invention also provides general methods for 
evaluating a peptide for an ability to induce an autoimmune 
response. These methods involve choosing an MHC HLA-DR 
molecule associated with the autoimmune response, selecting at 
least two major MHC binding pockets of the HLA-DR molecule, 
identifying sets of amino acid residues which bind within each 
of the selected pockets, developing a sequence motif -for the 
HLA-DR molecule in which the sets of amino acids define the 
allowed amino acids at the corresponding positions of the 
motif, and then comparing the amino acid sequence of the 
peptide to the sequence motif. Peptides which match the motif 
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have a much greater likelihood of inducing the autoimmune - 
disease. In addition, if there is a known epitope implicated 
in the disease, the method may further include selected at 
least one TCR contact residue of the epitope, identifying a 
set of amino acid residues which may serve as the TCR contact, 
and including this set in the motif at the appropriate 
position. In preferred embodiments, the motifs include 
restrictions on the residues at positions corresponding to at 
least the PI MHC binding pocket and at least one of the P4 and 
P6 pockets. 

In another embodiment of the invention, methods are 
provided specifically for identifying foreign antigens 
implicated in human autoimmune response. These methods 
include the same steps as the previously described methods, 
but further include a comparison of the resulting sequence 
motif to sets of human pathogens. In preferred embodiments, 
peptide sequences from one or more species in the normal human 
intestinal flora are excluded from consideration. In another 
preferred embodiment, sequences from one or more species of 
pathogen which is negatively correlated with the incidence of 
the disease are excluded. In a most preferred embodiment, the 
human pathogen peptides are searched and evaluated on a - 
computer database using the motif as a search criterion. 

Brief Description of the Drawings 
Figure 1 illustrates the pockets in the HLA-DR1 peptide 
binding. The upper right shows a top view of the molecular 
surface (1.5 Angstrom probe radius) of the HLA-DR1 
peptide-binding site with the Ca trace of the HLA-DRl shown 
as thin lines. Pockets that accomodate the peptide side 
chains of an antigen (in this case Influenza virus 
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hemagglutinin) are shown in detail in surrounding views<_and. 
are numbered as described herein. The PI pocket accomodates 
the Tyr (308) of the antigen. The P4, P6, P7 and P9 pockets 
bind antigen residues Gin (311), Thr (313), Leu (314) and Leu 
(316) respectively. Antigen peptide side chains and the 
nearby peptide main chain are represented by CPK models and 
HLA-DR side chains in contact with the peptide are indicated 
as stick models. Pockets are viewed in the plane of the 
peptide binding site, towards the N terminus of the peptide 
(PI and P6), towards the C terminus of the peptide (P4)~, 
towards the B helical region (P7) or towards the 
helical region (P9). A full color version of this figure may 
be found in Stern et al., 1994. 

Detailed Description of the Invention 
The present invention relates to a method of identifying 
and evaluating peptides for their ability to induce an 
autoimmune response or to cause autoimmune disease. In 
particular, the invention relates to methods of (1) evaluating 
self peptides for their potential involvement in autoimmune 
disease when the self epitope or autoantigen is unknown and 
(2) evaluating foreign peptides for their possible involvement 
in the aetiology of autoimmune disease. The invention also 
relates to specific peptides identified by the methods of the 
invention and representing self and foreign antigens 
implicated in, respectively, pemphigus vulgaris and multiple 
sclerosis. 

The methods rely upon the development of amino acid 
sequence motifs to which potential self or foreign epitopes 
may be compared. Each motif describes a finite set of amino 
acid sequences in which the residues at each (relative) 
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position may be (a) restricted to a single residue, (b) - 

allowed to vary amongst a restricted set of residues, or (c) 

allowed to vary amongst all possible residues. For 

consistency in this disclosure, but without limiting the 

invention in any way, these sequence motifs will be symbolized 

as strings of characters in which (a) a position which is 

restricted to a single residue will be represented by the 

one-letter abbreviation for that residue, (b) a position which 

is allowed to vary amongst a set of residues will be 

represented by a column of the one-letter abbreviations" for 

those residues, and (c) a position which is allowed to vary 

amongst all amino acid residues will be represented by an 

"X." As an example only, a motif might specify that the 

residue at a first position may be any one of the residues 

valine, leucine, isoleucine, methionine, or phenylalanine; 

that the residue at the second position must be histidine; 

that the residue at the third position may be any amino acid 

residue; that the residue at the fourth position may be any 

one of the residues valine, leucine, isoleucine, methionine, 

phenylalanine, tyrosine or tryptophan; and that the residue at 

the fifth position must be lysine. Such a motif would be 

represented by the following character string: 

V H X V K 
L L 
I I 
M M 
F F 

Y 

W 

In one aspect of the present invention, sequence motifs 
are developed by analysis of the binding domains or binding 
pockets of major histocompatibility complex HLA-DR proteins 
and/or the T cell receptor ("TCR") contact points of epitopes 
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bound to MHC molecules. By providing a detailed structural, 
analysis of the HLA-DR residues involved in forming the MHC 
binding pockets, one is enabled to make predictions of 
sequence motifs for binding to any of the HLA-DR proteins. 

In another aspect of the present invention, sequence 
motifs developed by the methods disclosed herein may be used 
to identify self peptide epitopes involved in an autoimmune 
response when the autoantigen is known or suspected. 

In another aspect of the present invention, methods of 
identifying foreign peptide epitopes implicated in autoimmune 
disease are provided. These methods involve the use of MHC 
and/or TCR binding motifs to identify peptides derived from 
certain classes of organisms or pathogens which may initiate 
human autoimmune response. In this aspect, the motifs may be 
developed according to the methods of the present invention or 
by other means known in the art. 

Using these sequence motifs as search, evaluation, or 
design criteria, one is enabled to identify classes of 
peptides which have a reasonable likelihood of binding to a 
particular MHC molecule and of interacting with a T cell 
rec eptor to induce T cell and/or autoimmune response. Use of 
these motifs, as opposed to pure sequence homology (whiGh 
excludes many peptides which are antigenically similar but 
quite distinct in sequence) or sequence homology with 
unlimited "conservative" substitutions (which admits many 
peptides which differ at critical highly conserved sites), 
represents a significant advance in the ability of oae of 
ordinary skill in the art to evaluate particular peptides for 
potential involvement in autoimmune disease and to search 
computer databases of peptide sequences to identify self and 
foreign peptides which may be involved in autoimmune 
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response. In addition, the use of MHC and/or TCR binding _ 
motifs to search limited databases for foreign peptides which 
may be implicated in the aetiology of autoimmune disease is a 
novel application of the concept of binding motifs. 

Detailed examples of the practice of the present invention 
are presented below. The methods of the invention have now 
been used to identify the previously unknown self peptide 
epitopes involved in the autoimmune disease pemphigus 
vulgaris. In addition, the methods have been used to identify 
a class of foreign peptides which may be involved in the 
initiation of autoimmune response in multiple sclerosis (for 
which the self epitope had previously been identified). 

Thus, in another embodiment, the invention provides these 
peptides, in isolated form, which may be used in various 
diagnostic and therapeutic methods and products alluded to 
below . 

I . The MHC Class II HLA-DR Molecular Mimicry Motif 

The HLA-DR binding site is characterized by five major 
pockets which may bind the amino acid side chains of antigens 
(Stern et al . , 1994, the entire disclosure of which is 
incorporated herein by reference). See Figure 1. The amino 
acid residue of the antigen which binds in the first major 
pocket is designated PI . The remaining residues may then be 
numbered by their positions relative to PI (with positive 
numbers increasing toward the carboxy terminus and negative 
numbers increasing toward the amino terminus): 



P-i . . . P-l Pl P2 P3 P4 . . . Pj 
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Thus, the first major pocket of an HLA-DR molecule, by 
definition, binds the side chain of residue PI on an antigen. 
The remaining major pockets bind residues P4 , P6, P7 and P9 . 
These residues are defined as the major MHC contact residues. 

The amino acid side chains of residues P-l, P2 , P3 , P5, 
P8, and Pll are oriented away from the HLA-DR binding site 
and, therefore, are available as contact residues for a T cell 
receptor (TCR) - All of these residues are defined as TCR 
contact residues. 

A. The MHC Contact Residues 

The first major pocket of the HLA-DR molecule is strongly 
hydrophobic. It is formed by a stretch of residues at about 
positions 85, 86, 89 and 90 of the fi chain, a stretch of 
residues at about positions 31, 32 and 34 of the a chain, 
and side chains from residues at about positions 7 and 43 of 
the a chain. For example, in HLA-DR 1 (DRA, DRB1*0101), the 
first pocket is formed by residues B85 (Val), B86 (Gly), B89 
(Phe), B90 (Thr), a31 (lie), <x32 (Phe), a34 (Phe), oc7 
(He), and a43 (Trp) . The corresponding residues for other 
HLA-DR alleles are known in the art (see, e.g., Marsh and 
Bodmer, 1992, incorporated by reference herein) and are 
available through genetic databases. 

Although most of the residues that shape the PI pocket are 
from the highly conserved DRa chain, the size and nature of 
this pocket varies due to polymorphisms in the B chain 
residues involved in the pocket. For the DRB1*0101 protein, 
the pocket is large and hydrophobic and can accommodate any of 
the aliphatic or aromatic residues. Polymorphism at the B 
residues, however, may alter the binding capacity of the PI 
pocket. For example, the B86 residue is known to be 
polymorphic. Most commonly, this site is occupied by either 
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Gly or Val. Generally, when Gly is present at B86 (as in _ 
DRB1*0101), any of the aliphatic or aromatic residues m~.y bind 
within the pocket. When Val is present, however, the pocket 
is smaller and Tyr and Trp cannot be accommodated. Thus, when 
B86 is Gly, position PI of the molecular mimicry motif may 
consist of residues chosen from V, L, I, A, M, F, Y, W and 
when 1386 is Val, position PI of the motif may consist of 
residues chosen from V, L, I, A, M, F. Similar considerations 
apply to the other B residues of the PI pocket. 

The P4 pocket of HLA-DR molecules is also a relatively 
large, shallow, hydrophobic pocket oriented across the antigen 
binding site. This pocket can bind a variety of large 
aliphatic side chains which can maintain hydrophobic 
interactions along the side and floor of the pocket. The 
pocket is formed by a stretch of residues at about positions 
70, 71, 74 and 78 of the B chain, and side chains from 
residues at about position 13 of the B chain and about 
position 9 of the a chain. For example, in HLA-DR1 (DRA, 
DRBl*010l), the P4 pocket is formed by residues B70 (Gin), B71 
(Arg) , B74 (Ala), B78 (Tyr), B13 (Phe), and <x9 (Gin). The 
corresponding residues for other HLA-DR alleles are known in 
the art (see, e.g., Marsh and Bodmer, 1992) and are available 
through genetic databases. 

Like the PI pocket, the P4 pocket is largely hydrophobic 
but its binding capacity is affected by polymorphisms at the B 
residues involved in the pocket. For example, different DR 
alleles have differently charged residues at position B71 :' In 
DRB1*0404, 871 is occupied by a positively charged Arg residue 
whereas in DRB1*0402 B71 is a negatively charged Glu residue. 
Thus, although this pocket can generally bind a variety of 
aliphatic or aromatic side chains (e.g., V, L, I, A, M, F, Y, 



H0498/7015 
5963a 



-17- 



W) , positively charged P4 antigen residues are disfavored when 
G71 is positively charged and, similarly, negatively charged 
P4 residues are disfavored when (371 is also negative. Similar 
considerations apply to the other B residues of the P4 
pocket. Note that some residues may be involved in forming 
each of two adjacent pockets (e.g., 1313 in the P4 and P6 
pockets) and therefore the occupancy of one of these pockets 
by a particular amino acid may influence the occupancy of the 
other . 

The P6 pocket of HLA-DR molecules is a relatively shallow 
pocket with a preference for smaller (e.g., A, G) P6 antigen 
residues. The pocket is formed by the highly conserved all, 
a62, a65 and a66 residues and the highly polymorphic fill 
and B13 residues of the HLA-DR protein. For example, in 
HLA-DRl (DRA, DRBl*0l0l), the P6 pocket is formed by residues 
all (Glu), a62 (Asn), a65 (Val), a66 (Asp), fill (Leu) 
and G13 (Phe). The corresponding residues for other HLA-DR 
alleles are known in the art (see, e.g., Marsh and Bodmer, 
1992) and are available through genetic databases. 

Although there are only two fi chain residues in the P6 
pocket, they vary widely amongst the DR alleles. With a large 
Phe residue at (313 (as in DRBl*010l), the P6 residue is, 
preferably one of the small residues (e.g., A, G) . In other 
DR alleles, however, 613 is occupied by smaller or more polar 
residues such as the B13 (His) of DRB1*0401. For such 
alleles, the P6 motif may include somewhat larger and polar 
residues (e.g., S, T, V) but should still avoid the largest 
and aromatic residues. Finally, in some alleles, 1311 and B13 
are both serine residues (e.g., DRB1*1101) and for these cases 
more hydrophilic or hydrogen bonding residues may be included 
in the motif. 
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The P7 pocket of HLA-DR molecules is also a relatively _ 
shallow pocket. The pocket is formed by five residues of the 
B chain: B28, B47, B61, 867 and B71. For example, in HLA-DR 1 
( DRA , DRB*0101), the P7 pocket is formed by residues B28 
(Glu), B47 (Tyr), B61 (Trp) , B67 (Leu) and B71 (Arg) . The 
corresponding residues for other HLA-DR alleles are known in 
the art (see, e.g., Marsh and Bodmer, 1992) and are available 
through genetic databases. This pocket does not appear to 
contribute greatly to the specificity of HLA-DR 1 but may m be 
important in other alleles. 

The P9 pocket of HLA-DR molecules is generally a small 
hydrophobic pocket and, therefore, small hydrophobic residues 
are preferred at the P9 position of the antigen. This pocket 
is formed by the conserved a chain residues a69, a72, 
a73 and a76 and by the polymorphic B chain residues B9 and 
B57. For example, in HLA-DR1 (DRA, DRB1*0101) the P9 pocket 
is formed by a69 (Asn), a72 (He), a73 (Met), a76 
(Arg), B9 (Trp) and B57 (Asp). The corresponding residues for 
other HLA-DR alleles are known in the art (see, e.g., Marsh 
and Bodmer, 1992) and are available through genetic databases. 

The P6, P7 and P9 pockets appear to be less important than 
the Pi and P4 pockets in binding to DR molecules but they may 
be more important in binding to other isotypes (e.g., the P9 
pocket of DQ may be important). 

B. The TCR Contact Residues 

When there is no known or suspected antigen involved in an 
autoimmune response, the positions of the sequence motif 
corresponding to the TCR contact residues may be left . 
unrestricted. That is, absent a known or suspected antigen, 
the TCR contact positions of the motif are preferably allowed 
to vary amongst all of the amino acids. 
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When, on the other hand, there is a known or suspected _ 
antigen involved in an autoimmune response, at least some of 
the motif positions corresponding to the TCR contact residues 
may be restricted according to the sequence of the antigen. 
Thus, for example, the P2 and/or P3 and/or P5 positions of the 
motif may be restricted to only those residues found at the 
corresponding positions of the antigen. Alternatively, at 
least some of the TCR contact residues of the motif may be 
restricted not just to the corresponding residues of the . 
antigen but may be allowed to vary amongst similarly charged 
and/cr structurally similar residues (e.g., K and R) . It 
should be noted, however, that greater conservatism with 
respect to the TCR contact residues of the motif is justified 
by the presumably greater specificity of TCR binding relative 
to the known promiscuity of MHC binding. 

C . Deve loping an HLA-DR Sequence Motif 

Given the present disclosure of the HLA-DR residues 
involved in the formation of the PI, P4 , P6, P7 and P9 MHC 
binding pockets, and given the nucleotide or corresponding 
amino acid sequence of any particular HLA-DR allele, one is 
now enabled to develop a sequence motif useful in evaluating 
or predicting the ability of peptides to bind to that MHC 
protein. When a particular antigen is known to (or is 
suspected of) binding to the MHC protein, the TCR contact 
residues of that antigen may also be considered in the motif. 

The method first requires the selection of two or more of 
the MHC binding pockets for which the choice of peptide 
residues will be restricted at the corresponding positions of 
the motif. One may select all five of the major binding 
pockets and develop a motif in which the corresponding five 
positions of the motif are restricted or one may select fewer 
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and develop a less restricted motif. As will be obvious- to-- 
one of ordinary skill in the art, a more restricted motif will 
identify a lesser number of peptides in a database search and 
a less restricted motif will identify a greater number of 
peptides. In all instances, at least two of the major binding 
pockets should be selected. When fewer than all five MHC 
binding pockets are selected, it is preferred that at least 
one is PI and that a second is chosen from P4, P6 and P9 . 

Either before or after the pockets to be restricted by the 
motif are selected, the set of amino acid side chains likely 
to bind within each of those pockets and, therefore, the set 
of amino acid residues that will define the corresponding 
positions of the motif, must be determined. This may be 
accomplished by one of ordinary skill in the art by 
considering the amino acid residues which form the pocket. 
These residues, identified in Section A above, will determine 
the size and nature (i.e., hydrophobic, hydrophilic, 
positively charged, negatively charged, uncharged) of the 
pocket and consequently, the side chains which may bind within 
the pocket. Reference may be had to Figure 1 during these 
considerations but will become increasingly unnecessary as one 
develops familiarity with the variations of the pockets r 

As a general matter, in light of the identification of the 
residues forming the MHC binding pockets of the HLA-DR 
proteins disclosed herein, one of ordinary skill in the art 
can easily develop a sequence binding motif for any HLA-DR 
protein for which these residues are known for two or. more 
binding pockets. The major considerations are size, 
hydrophobicity and charge. In light of the present 
disclosure, each of these considerations may be addressed 
according to well-known principles. A baseline is disclosed 
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herein for each pocket for the DRB1*0101 allele. Relative to 
this HLA-DR protein, one of ordinary skill is enabled to 
develop motifs for other HLA-DR alleles. Thus, substitutions 
which lead to larger /smal ler pockets suggest that the 
corresponding motif positions should be restricted so as to 
permit smaller/larger residues. Similarly, more/less 
hydrophobic pockets suggest that the corresponding motif 
positions should be restricted to more/less hydrophobic 
residues. Finally, positively/negatively charged pockets 
suggest that positively/negatively charged residues should be 
excluded and negatively/positively charged residues may be 
included at the corresponding motif positions. As noted 
above, the present disclosure enables one of ordinary skill to 
develop motifs based upon well-established principles. 

For example, and not by means of limitation, consider the 
PI pocket of the HLA-DR protein. The residues forming this 
pocket in the DRB1*0101 were described above. For DRB1*0101, 
the PI pocket is large and hydrophobic and can accommodate any 
of the aliphatic or aromatic residues (e.g., V, L, I, A, M. F, 
Y, W) . For the DRBl*1602 protein the same is true. On the 
other hand, in the DRB1*1501 protein, the B86 position is 
occupied by Val instead of the Gly found in DRB1*0101 and 
DRB1*1602. This substitution decreases the size of the PI 
pocket in this MHC protein and, as a result, the pocket cannot 
easily accommodate Tyr or Trp side chains. Thus, for 
DRB1*1501, the sequence motif at position PI may be restricted 
to residues chosen from V, L, I, A, M and F. 

Similarly, in light of the present disclosure, one of 
ordinary skill in the art may consider each of the MHC binding 
pockets, or only selected pockets, and develop a sequence 
motif for any HLA-DR protein for which the residues involved 
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in pocket formation are known. These residues will determine 
both the size and nature of the pocket and, thereby, the size 
and nature of the residues which may bind within it. When the 
pocket is relatively small, the largest amino acid residues 
(e.g., Y, W) may be excluded from the corresponding position 
of the motif and, when the pocket is charged, amino acid 
residues of the same charge may be excluded. 

If a self or foreign epitope involved in immune response 
is known or suspected, and particularly if its TCR contact 
residues can be defined through the use of responsive T'cell 
clones, the TCR contact residues of the epitope may also be 
considered in developing a sequence motif. As with the MHC 
contact residues, all or merely some of the TCR contact 
residues may be restricted in the motif. And, as with the MHC 
positions, the restriction of more positions {or the greater 
restriction of any one position) will result in the 
identification of fewer peptides in a database search. Unlike 
the MHC contact residues, for which at least two positions 
should be restricted in the motif, it is acceptable to omit 
any restrictions of TCR contact residues in the motif. 

If any TCR contact residue positions are restricted in the 
sequence motif, it is preferred that a position selected, from 
positions P2, P3 and P5 be chosen. Because, in contrast to 
the relative promiscuity of MHC binding pockets, TCR contact 
residues appear to have greater specificity, it is preferred 
that any TCR contact residue positions which are restricted in 
the motif be rather narrowly restricted. That is, it is 
preferred that such positions be restricted to just the 
residue found at the corresponding position of the known 
antigen or just to residues which are highly similar in 
structure and charge. 
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For example, and as further detailed below, the MBP _ _ 
(85-99) peptide is known to be the autoantigen implicated in 
multiple sclerosis. The P3 residue of this peptide is Phe 
(residue 91 of MBP) and the P5 residue is Lys (residue 93 of 
MBP). Thus, if P3 is restricted in the motif, it is preferred 
that it be restricted to F or, possibly, to F and Y. 
Similarly, if P5 is restricted, it is preferred that it be 
restricted to only K or to K and the similarly charged R. 
Alternatively, P3 and P5 may be left unrestricted. 

Obviously, MHC and TCR positions not selected for 
restriction may be represented by, in the notation of this 
disclosure, an X. Similarly, as shown in the examples below, 
several motifs may be developed with varying numbers of 
positions restricted to varying extents. 

I I . Identifying Self Epitopes Using Sequence Motifs 

An ever increasing number of autoimmune diseases are now 
being associated with particular alleles of the MHC Class II 
HLA-DR locus. For most of these autoimmune diseases, the self 
epitope remains unknown. For some, however, a self protein 
involved in autoimmune response is known or suspected. 

In one aspect of the present invention, a method is - 
provided for identifying the self epitopes involved in 
autoimmune diseases associated with HLA-DR alleles. That is, 
by comparing human peptide sequences to the sequence motifs of 
the present invention, one is enabled to identify those 
peptides which have the highest likelihood of being the self 
epitopes involved in the disease. 

The method may be applied to any of the autoimmune 
diseases for which an association with a particular HLA-DR 
allele is known and for which the amino acid residues forming 
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the mc binding pockets (or at least two of the major pockets) 
for that allele are known. In accordance with the method 
discussed herein, one may then develop one or more sequence 
motifs for the HLA-DR protein associated with the disease. 
Naturally, if the disease is associated with two or more 
alleles, motifs may be developed for the two or more HLA-DR 
proteins and, in particular, a consensus motif may be 
developed using the residues shared by each position of these 
motifs. 

The sequence motif or motifs thus developed are then 
compared to appropriate sets of human peptide sequences. The 
human peptide sequences may include all known human sequences 
or may be limited in ways which will be obvious to one of 
ordinary skill in the art. For example, if the disease is 
restricted to particular tissues, the search may be limited to 
peptides found in those tissues. Conversely, peptides which 
are also found in unaffected tissues may be eliminated from 
the search pool. In the most extreme case, when the 
autoantigen is known or suspected but the particular epitope 
is unknown, the search may be limited to sequences within the 
autoantigen (see Example 1). 

This method may be used to identify a set of peptides 
which match the motif and which are most likely to be self 
epitopes. By varying the number of positions restricted by 
the motifs, and/or the extent of restriction at each position, 
and/or the size of the search pool, the number of peptides in 
the set will, in all likelihood, also be varied. As noted' 
above, at least two of the MHC contact positions (e.g,, PI and 
P4) should be restricted. Depending upon the number of 
peptides in the resultant set, a more or less restrictive 
motif may then be employed to reduce or expand the set. The 
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desirec size of the resultant set depends, of course, ugon the 
subsequent intentions of the practitioner of this method. 

Once a set of peptides has been identified, these peptides 
may optionally be screened for activity. The choice of such 
screens is at the discretion of the practitioner and beyond 
the scope of the present invention. Preferred screens, 
however, include in vitro tests for the ability to induce the 
proliferation of autoreactive T cells or to induce the 
secretion of lymphokines (cytokines) from these T cells or to 
induce other effector functions such as cytotoxicity. In some 
circumstances, human in vivo tests may be appropriate and in 
other circumstances animal models of the human disease may be 
avai 1 able . 

Ill . Identifying Foreign Epitopes Implicated in Human 
Autoimmune Disease 

As noted in the Background section, epidemiological 
evidence has suggested that various bacterial and viral 
pathogens may be implicated in human autoimmune diseases and 
the concept of molecular mimicry pervades the literature 
(reviewed by Oldstone, 1990). Prior attempts to identify 
particular foreign epitopes involved in human autoimmune, 
disease, however, have depended upon direct sequence 
similarity to known human epitopes. The results have been 
disappointing and, to date, no pathogens or peptides derived 
from pathogens have been shown to be a primary cause of human 
autoimmune disease. 

Thus, in another aspect of the present invention, a method 
of identifying foreign epitopes implicated in human autoimmune 
disease is provided. That is, for the first time a method of 
identifying such foreign epitopes is provided which employs 
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sequence motifs to identify foreign peptides which have the _ 
highest likelihood of being involved in the aetiology of human 
autoimmune disease . 

The method may be applied to any of the autoimmune 
diseases for which an association with a particular MHC 
protein is known and for which either (1) a sequence motif has 
been developed by prior art methods or (2) a sequence motif 
may be developed by the methods of the present invention. 
When the self epitope is known or suspected, TCR contact . 
residues may be included in the motif. As before, one or more 
motifs may be employed and differently derived motifs may be 
combined to develop consensus motifs. 

The sequence motif or motifs thus developed may then be 
compared to appropriate sets of peptide sequences derived from 
human pathogens. This is most conveniently accomplished using 
genetic databases widely available to those of skill in the 
art. In a most preferred embodiment, the search pool is 
limited in one or more of the following ways: (1) only 
sequences from human bacterial or viral pathogens are 
included; (2) sequences from the normal human intestinal flora 
(e.g., E. coli or other Enterobacteriaceae ) are excluded; and 
(3) sequences from pathogens are included/excluded depending 
upon whether the geographical or epidemiological incidence of 
the pathogens are positively/negatively correlated with the 
incidence of the autoimmune disease in question (see Example 
2) . 

This method may be used to identify a set of foreign 
peptides which match the motif and which are most likely to be 
involved in the human disease. As before, the number of 
peptides in the set can be varied by using more or less 
restrictive motifs and/or by varying the search pool. And, as 
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before, the resultant set of peptides may subsequently be 
subjected to any of a variety of known screens for activity. 

IV. Self and Foreign Epitopes Identified by the Methods of 
the Present Invention 

As detailed in the examples below, the methods of the 
present invention have been employed (1) to identify seven 
self epitopes of the desmoglein 3 protein implicated in 
pemphigus vulgaris and (2) to identify eight foreign epitopes 
from human pathogens implicated in multiple sclerosis. 

Each of these peptides is fifteen residues in length, 
partly as a result of the computer database search program 
used (Genetics Computer Group program "Findpatterns" ) but also 
corresponding to the size of the cleft in. MHC class II 
molecules. The fifth position of each corresponds to the PI 
residue of the antigen. Thus, the P-2 to Pll residues which 
span the MHC Class II binding cleft correspond to the third 
through fifteenth residues of these sequences. The P-l to P9 
residues which are important to MHC and TCR binding correspond 
to the fourth through thirteenth positions. The most 
important residues for MHC and TCR binding, P-l to P6 
correspond to the fourth through tenth positions of theae 
sequences . 

SEQ ID NO.: 1 through SEQ ID NO.: 7, also shown in Table 1 
as PVA.l through P VA . 7 , correspond to residues 78-93, 97-111, 
190-204, 206-220, 251-265, 512-526 and 762-786 of the human 
desmoglein 3 protein. These peptides are implicated as self 
epitopes in pemphigus vulgaris. Already, as described in 
Example 1, two of these peptides have been shown to cause 
proliferation of T cells isolated from two patients with 
pemphigus . 
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SEQ ID NO. : 8 through SEQ ID NO. : 15, also shown in Table 
2 are internal fragments of the herpes simplex virus UL15 
protein, adenovirus Type 12 ORF, Pseudomonas aeruginosa 
phosphomannomutase, human Papi 1 lomavious Type 7 L2 protein, 
Epstein-Barr virus DNA polymerase, Influenza Type A 
hemagglutinin protein, Reovirus Type 3 sigma 2 protein, and 
herpes simplex DNA polymerase, respectively. These peptides 
are implicated as foreign epitopes involved in the aetiology 
or in remissions of multiple sclerosis. As detailed in . 
Example 2 below, each has been shown to be capable of inducing 
the proliferation of autoreactive T cell clones isolated from 
human multiple sclerosis patients. The sequence of the MBP 
(85-99) peptide is disclosed as SEQ ID NO.: 16. 

Each of these proteins has a variety of utilities and, 
therefore, in another aspect, the present invention provides 
each of these peptides in isolated form. In addition to the 
sequences of fifteen residues shown in the Sequence Listing 
and Tables, the present invention also embraces the fragments 
of these peptides corresponding to the MHC binding domain. In 
particular, the invention provides peptides corresponding to 
the P-2 to Pll, P-l to P9 and P-l to P6 positions of each of 
SEQ ID NO.: 1 through SEQ ID NO. : 15. As will be obvious to 
one of ordinary skill in the art, however, any fragment of any 
of these peptides which includes at least the PI and P4 or at 
least the PI and P6 or at least the P4, P6 and P7 residues may 
have utility and is intended to fall within the spirit and 
scope of the claims. In particular, longer peptides including 
these peptides or peptides including at least the MHC .binding 
and TCR contact residues described above are contemplated as 
equivalents . 
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The manner of production of these peptides is 
inconsequential but they may be isolated and purified from 
their natural sources or they may be synthesized. Because of 
their relatively short length, it is presently contemplated 
that they should be produced by synthesis. Methods of 
isolation, purification and synthesis of such peptides are 
well known in the art and need not be recited herein. 

The present invention also provides products and methods 
using other peptides which may be identified by the methods of 
the present invention. These peptides, as well as those 
disclosed above, may be used in each of the following 
embodiments . 

The peptides of the present invention may be used for in 
vitro assays to aid in the diagnosis and classification of 
pemphigus vulgaris and multiple sclerosis. For example, 
autoreactive T cells from patients with PV and MS may be 
tested, as in the examples below or by other known assays, for 
reactivity with these peptides. The ability or inability of 
these peptides to cause proliferation of the T cells will, in 
the case of pemphigus, allow for a refinement of the diagnosis 
by particular desmoglein 3 epitopes and, in the case of 
multiple sclerosis, allow further classification of the , 
disease by cross-reactive (self and foreign epitope) types. 
Immune response to these peptides prior to onset of disease 
may also be used as an indication of susceptibility or 
predisposition although care should be taken so as not to 
induce autoant igenic response. 

The peptides of the present invention may also be used in 
the development of animal models by immunizing animals (e.g., 
mice, rabbits, non-human primates) with these peptides. 
Animals which not only develop a response to the peptide but 
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which also develop an autoimmune disease corresponding to the 
human pathology will have obvious utility as models for the 
human disease. Animals which develop a response to the 
peptide without developing any corresponding autoimmune 
disease, will have utility as subjects for experiments 
involving the selective depletion of T cells or other forms of 
desensit ization or tolerizat ion . 

Importantly, these peptides and amino acid analogs of 
these peptides will have utility as therapeutic and diagnostic 
agents. The pathogens, viruses or bacteria from which they 
are derived will have utility as vaccinating agents. Some 
examples of the utility of these materials include the 
f ol lowing . 

The peptide may be administered in high doses to produce 
high dose tolerance. This process of tolerization is 
described in, for example, PCT patent application US93/08456 
(International publication number WO 94/06828). Thus, in one 
set of embodiments, the invention provides for pharmaceutical 
preparations for use in tolerizing individuals to 
autoantigens . The preparations may include a pharmaceutically 
acceptable carrier and an isolated human polypeptide which 
includes an amino acid sequence corresponding to a sequence 
motif for an HLA-DR protein which is associated with a human 
autoimmune disease. These polypeptides are capable of binding 
to the HLA-DR protein to form a complex which activates 
autoreactive T cells in subjects having the autoimmune 
disease. By using the peptides disclosed herein or identified 
by the methods of the invention, such pharmaceuticals can be 
used to combat autoimmune response. The use of such 
tolerization to human autoimmune diseases is known in the art 
and need not be elaborated upon here. Tolerizing doses of 
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collagen for rheumatoid arthritis and myelin basic protean for 
multiple sclerosis have been used. The present invention 
therefore specifically does not embrace these proteins. Other 
peptides, however, may now be identified by the present 
methods and be similarly used to treat autoimmune disease. 

In particular embodiments, such pharmaceutical 
preparations are provided in which the HLA-DR protein is 
HLA-DR4 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, using PV motif #1, pharmaceuticals 
having peptides with this motif are provided. In most 
preferred embodiments, the pharmaceuticals include at least 
one of the polypeptides of SEQ ID NO. 1 through SEQ ID NO . : 7 . 
Thus, methods of tolerizing an individual to a pemphigus 
vulgaris autoantigen are also provided. 

In a similar set of embodiments, the invention provides 
for pharmaceutical preparations for use in tolerizing 
individuals to antigens of human pathogens which are 
implicated in human autoimmune disease. The preparations 
include a pharmaceutical ly acceptable carrier and an isolated 
human pathogen polypeptide which includes an amino acid 
sequence corresponding to a sequence motif for an HLA-DR 
protein which is associated with a human autoimmune disease. 
These polypeptides are capable of binding to the HLA-DR 
protein to form a complex which activates autoreactive T cells 
in subjects having the autoimmune disease. Thus, by 
tolerizing an individual to these antigens, T cells which are 
cross-reactive with the self antigen will be renedereci 
unresponsive or anergized and protection from the disease will 
be afforded. 

In particular embodiments, such pharmaceutical 
preparations are provided in which the HLA-DR protein is 
HLA-DR2 protein and the autoimmune disease is multiple 
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sclerosis. In addition, using the the three MS motifs 
disclosed herein, pharmaceuticals having peptides with at 
least one of these motifs are provided. Specific embodiments 
of the pharmaceuticals include at least one of the 
polypeptides disclosed as SEQ ID N0.:8 through SEQ ID NO.: 
15. Thus, methods of tolerizing an individual to a multiple 
sclerosis foreign antigen are also provided. 

In set of embodiments, pharmaceuticals are provided for 
vaccination against a human pathogen implicated in the 
aetiology of autoimmune disease. These pharmaceutical - 
preparations include a pharmaceut ical ly acceptable carrier and 
an immunogenic preparation effective to immunize against a 
human pathogen. The human pathogen is one which in its native 
form includes a polypeptide having an amino acid sequence 
corresponding to a sequence motif for an HLA-DR protein which 
is associated with the autoimmune disease. These polypeptides 
are capable of binding to the HLA-DR protein to form a complex 
which activates T cells which become autoreactive and intiate 
the autoimmune disease. The preparations of the present 
invention specifically do not include such polypeptides but, 
rather, include other antigens from the pathogen. That is, a 
vaccine is produced which specifically does not include 
polypeptides which correspond to the sequence motifs for the 
HLA-DR protein and, if known, the TCR contacts of the self 
epitope. Because pathogens present a wide array of antigenic 
determinants, one may eliminate those which corespond to the 
relevant sequence motif and produce a vaccine which is 
effective against the pathogen but which will not include 
peptides implicated in the autoimmune response. 

Such vaccines, lacking peptides corresponding to the 
sequence motifs of the present invention, may be made in any 
convenient manner by one of ordinary skill in the art. For 
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example, when producing an influenza vaccine, one may compare 
the peptide secjuences of the influenza virus to sequence 
motifs developed according the present invention. The vaccine 
may then be made excluding the proteins which have the motif 
sequences (e.g. by using a fragment of the virus) or 
recombinant techniques may be used to produce a virus in which 
the sequences corresponding to the motif are altered such that 
they do not match the motif. In preferred embodiments, the 
altered residues are the TCR contact residues and, in 
particular, a substitution which alters the charge of a -TCR 
contact residue is preferred. Similar vaccines can be 
developed for bacterial pathogens using, for example, only 
portions of the bacterium (e.g. bacterial surface proteins or 
membrane associated proteins) which lack a motif associated 
with human autoimmune disease or, again, genetically altering 
the vaccine bacteria so as to change residues. 

The motifs which are considered in creating such vaccines 
may be chosen on any of several grounds. If the pathogen to 
which the vaccine is directed is associated with an autoimmune 
disease, a motif may be developed according to the methods 
described herein for an HLA-DR protein associated with the 
disease. If there is also a known or suspected autoant icjen, 
the motif may include TCR contact residues of the self 
epitope. The pathogens protein complement may then be 
compared to the motif and peptides corresponding to the motif 
may be omitted from the vaccine or may be altered by 
recombinant means to produce a vaccine free of such peptides. 
Alternatively, vaccines may be developed with particular 
populations in mind. For individuals suffering from or at 
risk of developing a particular autoimmune disease, special 
vaccines may be developed. In this case, the motif is again 
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chosen on the basis of the HLA-DR protein associated with the 
autoimmune disease and, when known, the TCR contact residuesof 
the self epitope. 

In particular embodiments, such vaccine preparations are 
provided in which the HLA-DR protein is HLA-DR4 protein and 
the autoimmune disease is pemphigus vulgaris. And, in 
particular, vaccines are provided lacking a peptide 
corresponding to the PV motif ttl disclosed herein. Specific 
embodiments of the vacines include vaccines lacking at least 
one of the peptides disclosed as SEQ ID NO.: 1 through SEQ ID 
NO.: 7. Thus, methods of immunizing an individual against 
pathogens which may cause pemphigus vulgaris are also provided. 

Similarly, vaccine preparations are provided in which the 
HLA-DR protein is HLA-DR2 protein and the autoimmune disease 
is multiple sclerosis. And, in particular, vaccines lacking a 
peptide corresponding to at least one of the three MS motifs 
disclosed herein are provided. Specific embodiments of the 
vacines include vaccines lacking at least one of the peptides 
disclosed as SEQ ID NO. : 8 through SEQ ID NO.: 15. In 
particularly preferred embodiments, the vaccines are developed 
from the pathogens listed in Table 2 but lack either the 
entire protein listed as the antigen or lack at least the 
peptides identified in the corresponding sequences. Thus, 
methods of immunizing an individual against pathogens which 
may cause multiple sclerosis are also provided. 

These peptides will also be useful in assessing which 
pathogen(s) may be important in a particular patient.. For 1 
example, the T cells from one patient may proliferate in 
response to one or a few of these peptides, while those from 
another patient may proliferate in response to a different 
peptide or set of peptides. Analogs of the peptides may be 
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synthesized in which one of the T cell receptor contact _ 
residues is substituted. For example, in the case of MS, 
peptides with the substitution of MBP91F by 91A or of MBP93K 
by 93A may be employed. Such analogs are not, however, 
limited to the substitution of these primary T cell receptor 
contact residues of to substitution by particular amino acids 
such as A. These peptide analogs may be used to anergize 
(inactive) autoreactive T cells (see, for example, 
Sloan-Lancaster et al., 1993 and 1994) upon administration to 
autoimmune patients. The viruses or bacterial pathogens may 
be useful in immunization by selecting viral or bacterial 
strains that do not carry the mimicry epitope. Proteins from 
these pathogens other than those that carry the mimicry 
epitope may also be selected for immunization. This therapy 
may be useful in preventing reinfection and thus remissions of 
the disease or in preventing initial infections in 
particularly susceptible populations (the most obvious example 
of which is a disease-free identical twin of a patient). 

EXAMPLES 

1 . identification of Self Epitopes of Pemphig us Vulgaris 

As noted above, pemphigus vulgaris (PV) is, in different 
ethnic groups, associated either with a DR4 allele (DRBl*0402) 
or with a rare DQ1 allele (DQB1*05032) ; only a small fraction 
of PV patients have neither susceptibility gene (Ahmed et al . , 
1991; Ahmed et al., 1990; Scharf et al., 1988b). The PV 
associated molecule has a negative charge (Glu) at the critical 
position B71; the neighboring position (G70) is also negatively 
charged. The DR4 subtype associated with PV is the only one 
that carries a negative charge at DRfi 71 (a positive charge 
(Arg) is found at DRfi 71 in the RA associated DR4 molecules). 



H0498/7015 

5963a 



-36- 



Although polymorphic, the P7 pocket residue DRB 67 ( Leu/I le) 
does not appear to be involved in peptide binding but probably 
acts as a TCR contact residue (Stern et al., 1994). 

The charge of a polymorphic residue at DRB 71 could 
therefore account for susceptibility to two different 
autoimmune syndromes associated with structurally similar DR4 
subtypes: DR4 alleles associated with susceptibility to 
rheumatoid arthritis have a positive charge at DRB 71 (Arg) 
while the DR4 allele associated with pemphigus vulgaris has a 
negative charge at DRB 71 (Glu) . Peptides selective for 
either DR4 molecule may therefore differ significantly in 
their charge at P4 : Peptides with a negative charge at P4 
would be expected to bind to the RA associated molecules but 
not the pemphigus associated DR4 molecule; in contrast a 
positive charge would be expected for the pemphigus peptide(s) 
at position 4. Due to the conserved nature of these molecules 
other peptide anchor residues (PI and P6) would not be 
expected to be different for these DR4 subtypes. 

A sequence motif for selective binding to the HLA-DR 
DRB1*0402 protein was developed according to the method 
disclosed herein. 

The 13 chain residues involved in forming the PI pocket 
for this allele are 685 (Val), B86 (Val), B89 (Phe) and B90 
(Thr). Thus, the presence of Val at B86 (instead of Gly as in 
DRB1*0101) suggests that the PI position of the motif be 
restricted to V, L, I, M and F. Alanine might also have been 
included but was not in this example. The P6 pocket is formed 
in part by the fill (Val) and B13 (His) of the DRB1*0402 
protein. Relative to the DRB1*0101 allele, in which these 
residues are Leu and Phe respectively, the P6 pocket of the 
DRB1*0402 protein is somewhat larger and more polar. Thus, 
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for the P6 position of the motif, S, T, N and V were allowed. 
Finally, the P4 pocket of this DR protein is formed in part by 
the residues B13 (His), B70 (Asp), B7l (Glu), B74 (Ala) and 
B78 (Tyr). As noted above, the two negatively charged 
residues as B70 and B71 create a preference for positively 
charged antigen residues and, therefore, the motif for the P4 
position was restricted to K and R. 

Thus, the sequence motif for the pemphigus vulgaris 
autoantigen was defined as: 

Position PI P2 P3 P4 P5 P6 

PV Motif ttl : V X X K X S 

L R T 

I N 

M V 

F 

Although the autoantigen for pemphigus vulgaris is known, 
the precise epitopes within the autoantigen have previously 
remained unknown. Using the method of the present invention, 
however, it has been possible to identify a small set of 
peptides which may serve as the autoant igenic determinants- 
The target antigen of pemphigus vulgaris is an epithelial 
adhesion molecule of the cadherin family, desmoglein 3 (Amagai 
et al., 1991). Desmoglein 3 mediates Ca* + dependent 
adhesion between keratinocytes; the autoantibodies interfere 
with cell adhesion with resulting blister formation (Takeichi, 
1990). The autoantibodies are thought to be pathogenic since 
a transient blistering disease is also seen in newborns of 
affected mothers due to transfer of maternal immunoglobulin to 
the fetus. Transfer of serum or desmoglein 3 specific 
antibodies to mice also results in acantholysis (Amagai et 
al., 1992). 
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Only seven peptides from this large protein (130 kDa_ SS9 
amino acids) matched the motif. These seven pemphigus, 
vulgaris antigens ( PVA . 1-PVA , 7 ) are presented below in Table 1 
with the residues corresponding to the MHC binding positions 
PI, P4 and P6 underlined. 

TABLE 1 

PVA.l (res. 78-93) ATQK I TYR I SG VG ID 

PVA. 2 (res. 97-111) FG I FVVDKNTGD INI 

PVA. 3 (res. 190-204) LNSKIAFKIVSQEPA 
PVA. 4 (res. 206-220) TPMFLLSRNTGEVRT 
PVA. 5 (res. 251-265) CECNIKVKDVNDNFP 
PVA. 6 (res. 512-526) SARTLNNRYTGPYTF 
PVA. 7 (res. 762-786) QSGTMRTRHSTGGTN 

Therefore, the selective presentation of one or several of 
these peptides by the PV associated DRB1*0402 molecule to T 
cells may be critical for initiating autoimmunity in PV. To 
test this, T cell lines were raised from blood mononuclear 
cells of two patients with active disease by stimulation with 
the seven candidate peptides. T cell lines were Expanded with 
rIL-2 and tested for recognition of the candidate peptides in 
a proliferation assay. T cell lines from both patients- 
recognized two peptides from the extracellular domain of 
desmoglein 3 (PVA. 3 and PVA. 4) that were located close to the 
major autoantibody recognition site. These T cell lines were 
HLA-DR restricted as T cell proliferation was blocked by a 
monoclonal antibody specific for HLA-DR but not by a-control 
antibody. These desmoglein 3 peptides are therefore . 
candidates for the T cell dependent induction of autoimmunity 
in pemphigus vulgaris. 
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The motifs of the present invention relating to the HLA-DR 
proteins can be taken one step further. In a different ethnic 
group PV is associated with a rare DQ1 subtype (DQB1*05032) 
that differs from the common DQ1 subtype only at position 57 
of the DQB chain (Sinha, et al . , 1988). In the PV associated 
molecule DQB 57 is negatively charged (Asp) whereas in the 
common DQ1 subtype it is not. The same position on the DQB 
chain has also been implicated in susceptibility to diabetes. 
In diabetes, however, the reverse is true: DQ2 and DQ8 
molecules associated with susceptibility to diabetes do~not 
have a negative charge at DQB 57 (Todd et al., 1987). 

Based on these observations it becomes clear that two 
polymorphic positions in the MHC class II B chain (position 71 
of DRB and position 57 of DQB) are critical for selective 
peptide binding and the development of autoimmunity. Based on 
the criteria described above, a diabetes linked peptide would 
be expected to have a negative charge at P9 since such a 
peptide would only bind to DQ molecules that do not have the 
same charge at DQB 57. In contrast, for the DQ1 associated 
cases of pemphigus, a peptide with a positive charge at P9 may 
be selective for the disease associated molecule which carries 
a negative charge at DQB 57. In the case of DR4 linked , 
autoimmunity, the charge at peptide position 4 confers 
selectivity to the disease associated DR4 molecule: RA 
peptides have a negative charge at P4, PV peptides a positive 
charge at P4 . Motifs for selective peptide binding may 
therefore prove to be tremendously useful in the identification 
of key epitopes that initiate human autoimmune diseases. This 
approach is expected to be useful not only for identifying 
peptides in PV, RA or diabetes but also for other autoimmune 
diseases where residues critical in peptide binding have been 
linked to disease susceptibility. 
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2 . Identification of Mimics of the Multi ple Scl erosis M BP _ 
Autoant igen 

Susceptibility to MS is associated with HLA-DR2 (DRA , 
DRB1*1501, the most common subtype of DR2) (Spielman et al., 
1982; Olerup et al., 1989). This MHC class II molecule is 
believed to play a critical role m the immunopathogenesis of 
MS by presenting immunodominant self peptides to autoreactive 
T cells. Following injection of MBP in experimental animals, 
T cells specific for immunodominant peptides of MBP mediate an 
inflammatory response in the CNS that can be accompanied by 
marked demyel inat ion (reviewed in Zamvil and Steinman, 1990). 
In previous studies two regions of human MBP were found to be 
immunodominant (residues 84-102 and 143-168) (Ota et al . 1990; 
Pette et al. 1990; Martin et al. 1990; Wucherpf ennig et al. 
1994a). Reactivity to the MBP(84-102) peptide was 
predominantly seen in subjects carrying HLA-DR2 . Using L cell 
transf ectants as antigen presenting cells, HLA-DR2b (DRA, 
DRB1*1501) was found to serve as the restriction element for 
these MBP(84-102) specific T cell clones. 

The M3P(84-102) peptide binds with high affinity to the 
HLA-DR2b molecule with two hydrophobic residues serving as the 
anchors PI (Val 89 of MBP) and P4 (Phe 92 of MBP) 
(Wucherpf ennig et al-, 1994a; Vogt et al. 1994). At position 
PI, Val 89 can be substituted by other aliphatic amino acids 
(Leu, lie), as well as by methionine and phenylalanine; 
alanine is tolerated at this position but reduces the affinity 
of the peptide for HLA-DR2b. At position P4 , all aliphatic 
and aromatic residues are permitted; again alanine is. 
tolerated but results in a loss of binding affinity. 

A mutational analysis of putative TCR contact points 
confirms that P3 (Phe 91) and P5 (Lys 93) are TCR contacts for 
the MBP( 85-99) specific clones; other residues such as P-l 
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(Val 88) and P2 (His 90) are important for some clones but not 
for others. Substitution of P3 (Phe 91) by alanine abolishes 
TCR recognition for all clones; some clones tolerated 
conservative substitutions (eg. tyrosine or aliphatic amino 
acids) while other clones do not. Substitution of P5 (Lys 93) 
by arginine is tolerated by most T cell clones but more 
drastic changes frequently result in a partial or complete 
loss of T cell reactivity. This analysis demonstrates that P2 
(His 90), P3 (Phe 91) and P5 (Lys 93) are the primary TCR 
contact residues for MBP while PI (Val 89) and P4 (Phe 92) are 
the primary MHC contact residues for MBP. This analysis also 
confirms the present teaching that the TCR contact residues of 
a motif should be highly if not absolutely conserved. 

Based on this structural characterization of the 
immunodominant MBP( 85-99) peptide, three sequence motifs were 
developed to search a protein database for viral and bacterial 
peptides that matched these requirements. The motifs focused 
on the core region of the peptide, residues P-l to P5 (88-93 
of the MBP protein), which contained the MHC and TCR contacts 
common to all clones. In the first motif, aliphatic amino 
acids were allowed at the first MHC anchor residue PI while 
both aliphatic and aromatic residues were permitted at the 
second MHC anchor P4 . For the TCR contacts, the Phe 91 at P3 
was absolutely conserved, the Lys 93 at P5 could be 
substituted only by arginine, while the His 90 at P2 and the 
Val 88 at P-l could be substituted by several structurally 
related amino acids. Thus, the first motif for multiple 
sclerosis antigens was defined as: 
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Pos i t ion 
MS Motif ttl 



P-l 
V 
L 
I 
A 
M 



PI 

V 

L 

I 
A 
M 



P2 

H 

N 

Q 

F 



P3 
F 



P4 

F 

Y 

W 

V 

L 

I 

A 

M 



P5 

K 

R 



The second motif omitted the Val 88 P-l as a TCR contact 
residue (used only by some clones) and permitted also aromatic 
amino acids at the first MHC anchor Pi (Val 89). This was 

done since the MBP(85-99) peptide is presented by different 
HLA-DR2 subtypes: Presentation by DRB1*1501 requires an 
aliphatic amino acid or phenylalanine at this position while 
aliphatic and all aromatic residues at this position can serve 
as anchor for DRB1*1602. This difference, as explained above, 
relates to the size of the primary pocket for binding this 
hydrophobic residue and is determined by the Val/Gly 
dimorphism at DRB86 (Val in *1501 and Gly in *1602) (Busch et 
al., 1991). Thus, the second motif for multiple sclerosis 
antigens was defined as: 



Position 

MS Motif #2: 



PI 
F 

y 
w 

V 
L 
I 
A 
M 



P2 

H 

N 

Q 
F 



P3 
F 



P4 

F 

Y 

W 

V 

L 

I 

A 

M 



P5 

K 

R 



The third sequence motif represented a modification of the 
TCR contact residues preferred by a subgroup of MBP( 85-99) 
specific clones. For these clones, P5 (Lys 93) was absolutely 
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conserved while P3 (Phe 91) could be substituted by some 
aromatic or aliphatic amino acids. The third motif for 
multiple sclerosis antigens was defined as: 



Pos ition 


PI 


P2 


P3 


P4 


P5 


MS Motif 83 : 


V 


H 


F 


F 


K 




L 


F 


Y 


Y 






I 


Y 


W 


W 






A 


W 


V 


V 






M 




L 
I 


L 
I 





These HLA-DR motifs also matched well with the structural 
requirements for an HLA-DQ1 restricted clone specific for the 
MBP(85-99) peptide. This clone required the same minimal 
peptide segment as DR2 restricted clones (residues 87-97). As 
in the DR2 restricted clones, P2 (His 90), P3 (Phe 91) and P5 
(Lys 93) appeared to be the primary TCR contact residues. 
Substitution of these hydrophobic positions by aspartic acid 
greatly diminished the stimulatory capacity of the peptide 
while substitutions by other hydrophobic amino acids were 
tolerated. These data suggest that the MBP(85-99) peptide is 
bound in a similar fashion to HLA-DR2b and to HLA-DQ1 and that 
the same peptide residues are critical for interaction with 
the TCR. 

These motifs were used as search criteria in a search of 
protein databases (PIR and SwissProt) using the Genetics 
Computer Group software (program: findpat terns ) . More than 
600 sequences of viral and bacterial origin were identified 
that matched these criteria. From this set, sequencers were 
selected based on the following criteria: (1) Viruses known to 
cause human pathology, (2) Viruses prevalent in the Northern 
Hemisphere where MS occurs most frequently, (3) Selected 
bacterial sequences associated with inflammatory CNS disease 
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(such as Borrelia burgdorferi ) and with invasive infections^ 
(such as Staphylococcus aureus , Klebsiella pneumoniae and 
Pseudomonas aeruginosa ) . Not included were most viruses that 
cause infections in tropical countries, sequences derived from 
vaccinia virus and a large number of sequences from E. coli 
(which is part of the normal intestinal flora). When multiple 
antigenic variants were present, one or several sequences that 
best fit the motifs were chosen. The selected peptides were 
synthesized by Pin-Technology on a 1 mg scale (Chiron 
Mimotopes, San Diego). Seventy peptides were made which fit 
motifs ttl and 82 and 59 peptides for motif 83. 

These peptides were then tested for their ability to 
activate human MBP(85-99) specific T cell clones that had been 
previously established from blood T cells of two patients with 
relapsing-remitting MS ( Wucherpf ennig et al., 1994a; 
Wucherpfennig et al., 1994b), Homozygous B cell lines that 
expressed DR2 (DRB1*1501 or DRB1*1602) or DQ1 were used as 
antigen presenting cells (APCs) in these T cell proliferation 
experiments (Wucherpfennig et al., 1994a). As a positive 
control, all clones were shown to be activated by 'the 
MBP(85-99) peptide. Seven clones were tested with the 
viral/bacterial peptides selected according to the sequence 
motifs - 

Three of the seven clones tested were efficiently 
activated by several viral/bacterial peptides. The first 
clone (Hy.lBll) was HLA-DQ1 restricted whereas the other two 
clones (Hy.2Ell and Hy.lGll) were HLA-DR2 restricted _ 
(Wucherpfennig et al., 1994a). Among the 70 peptides .selected 
according to motifs 81 and 82, three mimicry peptides 
stimulated the DQ1 restricted clone and two peptides 
stimulated both of the DR2 restricted T cell clones. Among 
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the group of 59 peptides selected according to motif tt3, one 
peptide was identified for the DQ1 restricted clone while two 
were identified for the DR2 restricted clones. 

Taken together, the DQ1 restricted T cell clone recognized 
five structurally related peptides: the immunodominant 
MBP(85-99) peptide, three viral peptides (from Herpes simplex, 
Adenovirus type 12 and Human Papillomavirus) and a bacterial 
peptide ( Pseudomonas aeruginosa ). Two of the DR2 restricted 
clones were activated by four peptides. Both clones 
recognized the MBP( 85-99) peptide as well as peptides from EBV 
and Influenza virus. In addition, one clone recognized a 
viral peptide from Reovirus (clone Hy.2Ell) and one recognized 
a peptide from Herpes simplex virus (clone Hy.lGll). These 
results, and the sequences of these peptides, are summarized 
below in Table 2. 

TABLE 2 

Peptides Recognized by Clone Hy.lBll (DQ1 Restricted): 
MBP < 8 5 - 9 9 ) ENP WHFFKN I VTPR 

Herpes Simplex, UL15 Protein FRQLVHFVRDFAQLL 
Adenovirus Type 12, ORF DFEWTFLKDVLPEF 
Pseudomonas aeruginosa , Phosphomannomutase DRLLMLFAKDVVSRN 
Human Papillomavirus Type 7, L2 Protein I GGRVHFFKD I S P I A 

Peptides Recognized by Clone Hy.2Ell (DR2 Restricted): 

MBP ( 8 5 -9 9 ) ENP WHFFKN I VTPR 

Epstein-Barr Virus, DNA Polymerase TGGVYHFVKKHVHES 

Influenza Type A, Hemagglutinin YRNL VWF I KKNTRYP 

Reovirus Type 3, Sigma 2 Protein MARAAFLFKTVGFGG 
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Peptides Recognized by Clone Hy.lGll (DR2 Restricted): 

MBP( 85-99) ENP WHFFKN I VTPR 

Epstein-Bar r Virus, DNA polymerase TGGVYHFVKKHVHES 

Influenza Type A, Hemagglutinin YRNLVWF I KKNTRYP 

Herpes Simplex, DNA Polymerase GGRRLFFVKAHVRES 

For these viral/bacterial peptides to be involved in the 
initiation of autoimmunity, they have to be capable of potent 
T cell stimulation that results in marked clonal expansion of 
autoaggressive T cell clones. The stimulatory capacity -of 
each of these peptides was therefore compared to the 
MBP( 85-99) peptide in a titration experiment. The peptides 
were found to be efficient stimulators of the MBP specific T 
cell clones; in particular the EBV peptide (DR2 restricted 
clones) and the adenovirus peptide (DQ1 restricted clone) were 
similar to the MBP(85-99) peptide in their stimulatory 
capacity. These results demonstrate that the T cell 
activation is not the result of a minor degree of 
"cross-reactivity' but rather the result of structural 
similarity sufficient for potent T cell activation. 

Comparison of the peptide sequences that were found to 
stimulate the same TCR revealed several interesting points: 

(1) Only one peptide (Human Papillomavirus L2 protein) had 
striking sequence similarity with the MBP(85-99) peptide in 
that all amino acids in the MBP(89-95) segment except position 
94 (Asn to Asp) were identical (Table 2). For all other 
sequences, simple alignment would not have predicted them to 
be efficient stimulators of MBP(85-99) specific T celi' 
clones. Therefore, absent the method of the present 
invention, these peptides would not have been identified. 
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(2) At positions not specified by the search criteria, the 
selection for particular amino acids was still apparent (Table 
2). For the DQ1 restricted clone, for example, aspartic acid 
was selected at position P6 (residue 94 of MBP), a probable 
TCR contact residue, in all four peptides. This position is 
occupied by asparagine in the MBP peptide (similar size, but 
no negative charge) . Substitution of Asn 94 for Asp in the 
MBP peptide markedly increased its stimulatory capacity for 
the DQ1 restricted clone but reduced it for the DR2 restricted 
clone. Selection also occurred at the neighboring P7 MHC 
contact (He 95) for which He, Val or Phe were selected (all 
hydrophobic ) . 

(3) Different selection events occurred for the DQ1 and 
the DR2 restricted clones: At position P6 (94 of MBP), there 
was selection of aspartic acid (negative charge) for DQl 
peptides whereas there was selection for lysine (positive 
charge) in two of the three peptides presented by DR2 . 

(4) In the flanking segments (residues 85-87 and 97-99), 
no apparent selection took place as amino acids with different 
size and charge were allowed. 

The majority of the viruses from which these sequence 
peptides were identified are common human pathogens: Influenza 
Type A frequently causes respiratory tract infections; Human 
Papillomavirus infects epithelial tissues and has been linked 
to cervical carcinomas; and Epstein-Barr Virus (EBV) causes an 
acute viral syndrome (infectious mononucleosis) in young 
adults. Human Herpesvirus I (Herpes simplex), EBV and Human 
Papillomavirus cause latent or persistent infections with 
neurons (Herpes simplex), B cells (EBV) and epithelial cells 
(Papillomavirus) serving as the reservoirs. Viral expression 
can be reactivated by UV exposure and stress (Herpes simplex) 
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and by B cell activation (EBV) (Schwarz et al, 1985; Epstein 
et al., 1977; Spruance 1985; Tovey et al., 1978). For the 
induction and maintenance of an autoimmune response these 
persistent viral infections are of particular interest as they 
could explain the chronicity of the clinical disease and the 
clonal expansion and persistence of MBP specific T cells. 
Reactivation of viral expression may also be involved in 
triggering clinical relapses. By this mechanism, viral 
peptides could activate resting HBP specific T cells in 
periphery and allow them to invade the CNS. 

Are these foreign epitopes actually presented to 
autoreactive T cells during a viral infection? The peptide 
from the EBV DNA polymerase allowed this question to be 
addressed. In EBV transformed B cells (which were used as 
antigen presenting cells in the T cell assays) the lytic viral 
cycle is repressed. The DNA polymerase gene is not transcribed 
in this latent state; however, B cell activation results in 
activation of the lytic cycle and in the expression of the DNA 
polymerase gene (Datta et al., 1980). To examine MHC class II 
restricted presentation of the EBV DNA polymerase; an 
HLA-DR2* EBV transformed B cell line (MGAR) and an MHC 
mismatched control (9001, HLA-DR1) were pretreated for 36 
hours with phorbol ester which was removed by extensive 
washing prior to coculture of APC with T cells. T cell clones 
Hy.2Ell and Hy.lGll, which recognize the EBV DNA polymerase 
peptide presented by HLA-DR2, were activated by a HLA-DR2 + 
EBV transformed B cell line pretreated with phorbol ester. 
This effect was specific because MHC mismatched B cells did 
not activate the clones; also, a control clone (Ob.lAl'2) that 
recognized MBP( 85-99) but not the EBV peptide was not 
activated. In a separate experiment, T cell activation was 
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blocked by a mAb specific for HLA-DR (mAb L243 ) but not by a 
mAb specific for HLA-DQ (G2a.5). These results demonstrate 
that the MBP specific T cell clones recognize not only the 
viral peptide but also antigen presenting cells infected with 
the virus. In vivo this recognition event could lead to 
chronic antigenic stimulation of MBP specific T cells as B 
cell activation results in the expression of EBV genes, 
including the DNA polymerase gene. 

Finally, the presentation of the viral peptides by 
different DR2 subtypes was compared to determine if they are 
efficiently presented by the disease associated molecule 
(DRB1*1501, the most common DR2 subtype). The MBP peptide was 
presented by three of the four DR2 subtypes; the peptide was 
not presented by DRB1*1601 which differs from DRB1*1602 by a 
single amino acid substitution (at position DR867, a possible 
TCR contact). The two viral peptides were presented much 
better by the DR15 molecules (DRB1*1501 and 1502 which differ 
only at position DR886) than by DRB1*1602. This was 
particularly evident for the influenza peptide which only 
activated the T cell clone when presented by the 
DRB1*1501/1502 molecules but not by DRB1*1602. These results 
indicate that the viral peptides identified herein are _ 
preferentially presented by the MS associated DR2 molecule 
(DRB1*1501) . 

DEFINITIONS 

For clarity of interpretation and to clearly and 
distinctly point out the subject matter of the claimed 
invention, the following definitions are provided for several 
terms used in the claims appended hereto. 
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Th e term "sequence motif," in accordance with description 
provided herein, means a series of restrictions on the 
residues which may occupy certain relative positions of an 
amino acid sequence. A sequence motif must restrict at least 
three and preferably four or five positions of an amino acid 
sequence. The relative positions of the first (N-terminal) 
and last (C-terminal) restricted amino acid positions shall be 
separated by at least two but no more than twelve amino acid 
residues. For example, PI and P4 may be the first and last 
restricted residues and these residues are separated by- two 
residues. As another example, P-l and Pll may be the first 
and last restricted residues and these are separated by ten 
residues. Positions between the first and last restricted 
positions may be restricted or unrestricted with the exception 
that a total of at least three positions of the motif must be 
restricted. Of the three positions which must be restricted, 
at least two must be residues corresponding to major MHC 
binding pockets. If only two of the restricted residues 
correspond to WHC binding residues, the third must correspond 
to a TCR contact residue. Further, at least one of the 
positions restricted must correspond to either the PI or P4 
binding position. By 11 restricted" is meant that at least one, 
and preferably ten, amino acid residues shall be excluded from 
a position. 

An amino acid sequence "corresponds" to a sequence motif 
if it can be aligned with the positions of the sequence motif 
such that, at each restricted position of the motif, the acnino 
acid sequence includes a residue which is not excluded from 
that position by the restrictions which define the motif. As 
the restrictions which define the motif are derived from the 
size and nature of the MHC binding pockets of HLA-DR proteins 
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and, optionally, the TCR contact residues of known epitopes, 
the restricted positions of the binding motif may also be said 
to correspond to the MHC binding pockets and TCR contact 
residues . 

The term "isolated," in reference to a protein or 
polypeptide, means separated from its native or natural 
chemical microenvironment . Thus, a polypeptide isolated from 
a bacterium shall be in a preparation substantially free of 
most of the other bacterial polypeptides and, similarly, an 
isolated viral polypeptide preparation shall be substantially 
free of the other polypeptides which comprise the virus. 

The term "associated with," as used in connection with a 
particular HLA-DR protein and an autoimmune disease or 
autoimmune response, shall mean that the protein and the 
disease/response have been positively correlated by clinical 
or epidemiological studies such that the likelihood of 
developing the disease/response has been shown to be increased 
by presence of the protein. 

The term "HLA-DR protein" means the particular protein 
product of a particular allele of the MHC class II HLA-DR 
gene. A disease associated with an HLA-DR protein is one 
associated with such a particular protein and not merely^ with 
the HLA-DR gene locus. 

By the term "human pathogen" is meant a bacterium, virus 
or protozoon capable of infecting humans and generating an 
immune response. The term specifically is intended to exclude 
the bacteria which form part of the normal human intestinal 
flora. By this term, "normal human intestinal flora,"" is 
meant the bacteria which normally inhabit the human gut, such 
as Escherichia coli, but which do not normally cause disease. 
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By the term "autoreactive/' as applied to T cells, is 
meant T cells from a human which are activated by a human self 
epitope. By "activation" of T cells is meant induction to 
proliferate, secrete lymphokines (cytokines) and/or to 
initiate effector activity (e.g. cytotoxicity). 

By the term "autoant igen" is meant a self protein or 
polypeptide which includes a "self epitope." By "self 
epitope" is meant that part of an autoantigen which is 
recognized by T cells when bound to and presented by an MHC 
molecule. 

By the term "effective amount," with respect to tolerizing 
an individual to an antigen, is meant an amount of the antigen 
sufficient to render T cells, otherwise specific for the 
antigen, unresponsive to the antigen when bound to and 
presented by an MHC molecule. T cells which are unresponsive 
fail to activate when presented with the antigen for which 
they are specific. By the term "effective amount," with 
respect to immunizing an individual to an antigen, is meant an 
amount sufficient to induce an immune response which results 
in T cells specific for the antigen. Typical ranges of 
dosages are from 1 nanogram/kilogram to 100 
milligrams/kilogram or even 500 milligrams/kilogram. 
Effective amounts will vary according to such factors as age , 
sex and sensitivity to the antigen. 

By the term "core MHC binding residues" is meant the 
residues of an epitope corresponding to the P-l to P9 
positions of a peptide bound to an HLA-DR molecule. By the t 
term "inner core MHC binding residues" is meant those residues 
of an epitope corresponding to the P-l to P6 positions of a 
peptide bound to an HLA-DR molecule. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: STROMINGER , JACK L. 

WUCHERPFENNIG, KAI 

(ii) TITLE OF INVENTION: IDENTIFICATION OF SELF AND NON-SELF 
ANTIGENS IMPLICATED IN AUTOIMMUNE DISEASE 

(iii) NUMBER OF SEQUENCES: 16 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: WOLF, GREENFIELD & SACKS, P.C. 

(B) STREET: 600 ATLANTIC AVENUE 

(C) CITY: BOSTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02210 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release 81.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: TWOMEY, MICHAEL J. 

(B) REGISTRATION NUMBER: 38,349 

(C) REFERENCE /DOCKET NUMBER: HO 4 98/70 15 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ala Thr Gin Lys He Thr Tyr Arg He Ser Gly Val Gly He Asp 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Phe Gly He Phe Val Val Asp Lys Asn Thr Gly Asp He Asn He 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Leu Asn Ser Lys He Ala Phe Lys He Val Ser Gin Glu Pro Ala 
1 5 10 _ 15 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Thr Pro Met Phe Leu Leu Ser Arg Asn Thr Gly Glu Val Arg Thr 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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( i i i ) HYPOTHETI CAL : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Cvs Glu Cys Asn He Lys Val Lys Asp Val Asn Asp Asn Phe Pro 
1 5 1° 15 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

( i i i ) HYPOTHET I CAL : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Ala Arg Thr Leu Asn Asn Arg Tyr Thr Gly Pro Tyr T*ir Phe 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Gin Ser Gly Thr Met Arg Thr Arg His Ser Thr Gly Gly Thr Asn 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HERPES SIMPLEX 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Phe Arq Gin Leu Val His Phe Val Arg Asp Phe Ala Gin Leu Leu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM : ADENOVIRUS TYPE 12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Asp Phe Glu Val Val Thr Phe Leu Lys Asp Val Leu Pro Glu Phe 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PSEUDOMONAS AERUGINOSA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Arq Leu Leu Met Leu Phe Ala Lys Asp Val Val.Ser Arg Asn 
1 5 -10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : HUMAN PAPILLOMAVIRUS TYPE 7 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

He Gly Gly Arg Val His Phe Phe Lys Asp He Ser Pro He Ala 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EPSTE I N-BARR VIRUS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Thr Gly Gly Val Tyr His Phe Val Lys Lys His Val His Glu Ser 
15 10 I 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : INFLUENZA VIRUS TYPE A 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tyr Arg Asn Leu Val Trp Phe He Lys Lys Asn Thr Arg Tyr Pro 
1^5 10 15 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : REOVIRUS TYPE 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Arg Ala Ala Phe Leu Phe Lys Thr Val Gly Phe Giy Giy 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HERPES SIMPLEX VIRUS 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His Val Arg Glu Set 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 
Glu Asn Pro Val Val His Phe Phe Lys Asn lie Val Thr Pro Arg 



1 



5 



10 



15 
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CLAIMS 

We claim: 

1. An isolated polypeptide consisting essentially of an 
amino acid sequence selected from the group consisting of SEQ 
ID NO.: 1, SEQ ID NO. : 2, SEQ ID NO. : 3, SEQ ID NO. : 4,_SEQ ID 
NO.: 5, SEQ ID NO. : 6, and SEQ ID NO.: 7. 

2. An isolated polypeptide consisting essentially of an 
amino acid sequence selected from the group consisting of SEQ 
ID NO.: 8, SEQ ID NO.: 9, SEQ ID NO.: 10, SEQ ID NO.: 11, SEQ 
ID NO.: 12, SEQ ID NO.: 13, SEQ ID NO.: 14, and SEQ ID NO. : 15. 

3. A pharmaceutical preparation for tolerization comprising 
a pharmaceutical ly acceptable carrier and 

an amount of an isolated human polypeptide effective for 
tolerizing an individual to an autoantigen, said human 
polypeptide including an amino acid sequence corresponding to 
a sequence motif for an HLA-DR protein; 

wherein said HLA-DR protein is associated with a human 
autoimmune disease; 

wherein said polypeptide binds to said HLA-DR protein; 

wherein said polypeptide bound to said HLA-DR protein 
activates autoreactive T cells from a subject having said 
autoimmune disease; and 

wherein said polypeptide is a non-collagen and 
non-myelin basic protein polypeptide. 
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4. The pharmaceutical preparation of claim 3 wherein said 
HLA-DR protein is an HLA-DR4 protein and said autoimmune 
disease is pemphigus vulgaris. 

5. The pharmaceutical preparation of claim 4 wherein said 
motif is PV motif Ml. 

6. The pharmaceutical preparation of claim 4 wherein said 
amino acid sequence consists essentially of an amino acid 
sequence selected from the group consisting of SEQ ID NO.: 1, 
SEQ ID NO. : 2, SEQ ID NO. : 3, SEQ ID NO. : 4, SEQ ID NO.: 5, 
SEQ ID NO. : 6, and SEQ ID NO. : 7. 

7. A pharmaceutical preparation for tolerization comprising 
a pharmaceutical ly acceptable carrier and 

an amount of an isolated human pathogen polypeptide 
effective for tolerizing an individual to said polypeptide, 
said polypeptide including an amino acid sequence 
corresponding to a sequence motif for an HLA-DR protein; 

wherein said HLA-DR protein is associated with a human 
autoimmune disease; 

wherein said polypeptide binds to said HLA-DR protein; 

and 

wherein said polypeptide bound to said HLA-DR protein 
activates autoreactive T cells from a subject having said 
autoimmune disease. 

8. The pharmaceutical preparation of claim 7 wherein said 
HLA-DR protein is an HLA-DR2 protein and said autoimmune 
disease is multiple sclerosis. 
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9. The pharmaceutical preparation of claim 8 wherein ^said 
motif is selected from the group consisting of MS motif ttl, MS 
motif «2 and MS motif 83. 

10. The pharmaceutical preparation of claim 8 wherein said 
amino acid sequence consists essentially of an amino acid 
sequence selected from the group consisting of SEQ ID NO. : 8, 
SEQ ID NO.: 9, SEQ ID NO.: 10, SEQ ID NO.: 11, SEQ ID NO. : 12, 
SEQ ID NO.: 13 , SEQ ID NO.: 14, and SEQ ID NO.: 15. 

11. A method of tolerizing an individual to an autoantigen 
of pemphigus vulgaris comprising 

administering an effective amount of the pharamaceut ical 
preparation of any one of claims 4-6 to a subject in need of 
such treatment . 

12. A method of tolerizing an individual to a foreign 
antigen implicated in multiple sclerosis comprising 

administering an effective amount of the pharmaceutical 
preparation of any one of claims 8-10 to an individual in need 
of such treatment. 

13. A pharmaceutical preparation for vaccinating an 
individual at risk of an autoimmune disease comprising a 
pharmaceutical ly acceptable carrier and 

an amount of an immunogenic preparation effective to 
immunize against a human pathogen that in its native form t 
includes a polypeptide having an amino acid sequence 
corresponding to a sequence motif for an HLA-DR protein; 

wherein said HLA-DR protein is associated with said 
autoimmune disease; 
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wherein said polypeptide binds to said HLA-DR protein; 

wherein said polypeptide bound to said HLA-DR protein 
activates autoreactive T cells from a subject having said 
autoimmune disease; and 

wherein said preparation is free of a polypeptide 
corresponding to said sequence. 

14. The pharmaceutical preparation of claim 13 wherein said 
HLA-DR protein is an HLA-DR4 protein and said autoimmune 
disease is pemphigus vulgaris. 

15. The pharmaceutical preparation of claim 14 wherein said 
motif is PV motif Ml. 

16. The pharmaceutical preparation of claim 14 wherein said 
amino acid sequence consists essentially of an amino acid 
sequence selected from the group consisting of SEQ ID NO.: 1, 
SEQ ID NO. : 2, SEQ ID NO. : 3, SEQ ID NO.: 4, SEQ ID NO. : 5, 
SEQ ID NO.: 6 , and SEQ ID NO.: 7. 

17. The pharmaceutical preparation of claim 13 wherein said 
HLA-DR protein is an HLA-DR2 protein and said autoimmune^ 
disease is multiple sclerosis. 

18. The pharmaceutical preparation of claim 17 wherein said 
motif is selected from the group consisting of MS motif ttl, MS 
motif « 2 and MS motif H3. 

19. The pharmaceutical preparation of claim 17 wherein said 
amino acid sequence consists essentially of an amino acid 
sequence selected from the group consisting of SEQ ID NO. : 8, 
SEQ ID NO.: 9, SEQ ID NO.: 10, SEQ ID NO.: 11, SEQ ID NO.: 12, 
SEQ ID NO. : 13, SEQ ID NO.: 14, and SEQ ID NO.: 15. 
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20. A method of vaccinating an individual at risk of 
pemphigus vulgaris comprising 

administering an effective amount of the pharmaceutical 

preparation of any one of claims 14-16. 



21. A method of vaccinating an individual at risk of multiple 
sclerosis comprising 

administering an effective amount of the pharmaceutical 
preparation of any one of claims . 17-19 . 

22. The pharmaceutical preparation of claim 17 wherein said 
pathogen and said peptide are selected from the group 
consisting of the respective pairs' Herpes simplex virus and 
UL15 protein, Herpes simplex virus and SEQ ID NO.: 8, 
Adenovirus and Adenovirus ORF protein, Adenovirus and SEQ ID 
NO.: 09, Pseudomonas aeruginosa and phosphomannomutase protein, 
Pseudomonas aeruginosa and SEQ ID NO. : 10, Papillomavirus and 
L2 protein, Papillomavirus and SEQ ID NO. : 11, Epstein-Barr 
virus and DNA polymerase protein, Epstein-Barr virus and SEQ ID 
NO.: 12, Influenza virus and hemagglutinin protein, Influenza 
virus and SEQ ID NO. : 13, Reovirus and sigma 2 protein, 
Reovirus sand SEQ ID NO.: 14, Herpes simplex virus and DNA 
polymerase, and Herpes simplex and SEQ ID NO.: 15. 

23. A method of evaluating a peptide for an ability to induce 
an autoimmune response comprising the steps of 

(1) choosing an MHC HLA-DR molecule associated with said 
autoimmune response, said MHC molecule having major MHC binding 
pockets designated PI, P4 , P6, P7 and P9 , each said pocket 
binding an amino acid residue at a corresponding relative 
position, Px, of an epitope; 
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(2) selecting a first major WHC binding pocket Pi and a 
second major MHC binding pocket P j ; 

(3) identifying a first set of amino acid residues which 
bind within said first pocket, identifying a second set of 
amino acid residues which bind within said second pocket; and 

(4) comparing an amino acid sequence of said peptide to a 
sequence motif wherein said motif includes said first set of 
amino acid residues at a relative position Pi and said second 
set of amino acid residues at a-relative position P j . 

24 . The method of claim 23 

wherein said epitope is an epitope of known amino acid 
sequence and wherein said epitope has TCR contact residues 
designated P-l, P2 , P3 , P5, P8 and Pll; 

wherein step (2) further comprises selecting a first TCR 
contact point at a relative position Pk of said epitope and 
wherein step (3) further comprises identifying a third set of 
amino acid residues which bind a TCR at said contact point, and 

wherein said motif includes said third set of amino acid 
residues at a relative position Pk. 

25. The method of any one of claims 23 and 24 wherein^said 
first binding pocket is a PI pocket and said second binding 
pocket is selected from the group consisting of a P4 pocket and 
a P6 pocket . 
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26. A method of identifying foreign antigens implicated in 
human autoimmune response comprising the steps of 

(1) choosing an MHC HLA-DR molecule associated with said 
autoimmune response, said MHC molecule having major MHC binding 
pockets designated Pi, P4 , P6, P7 and P9 , each said pocket 
binding an amino acid residue at a relative position Px of an 
epitope; 

(2) selecting a first major MHC binding pocket Pi and a 
second major MHC binding pocket P j ; 

(3) identifying a first set of amino acid residues which 
bind within said first pocket, identifying a second set of 
amino acid residues which bind within said second pocket; 

(4) defining a sequence motif wherein said motif includes 
said first set of amino acid residues at a relative position Pi 
and said second set of amino acid residues at a relative 
position Pj ; 

(5) identifying a set of human pathogen peptide sequences 
corresponding to said motif. 

27. The method of claim 26 further comprising the step of 
excluding from said set sequences from at least one species in 
a normal human intestinal flora, 

28. The method of claim 26 further comprising the step of 
excluding from said set sequences from at least one species of 
pathogen negatively correlated with the incidence of said 
response . 

29. The method of claim 26 wherein step (5) includes* a search 
of a computer database using said motif as a search criterion. 
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ABSTRACT 



The present invention provides isolated peptides relating 
to the autoimmune diseases pemphigus vulgaris and multiple 
sclerosis- The peptides relating to pemphigus vulgaris- are 
self epitopes and those relating to multiple sclerosis are 
foreign antigens derived from human pathogens which are 
implicated in the aetiology and remissions of the disease. 
Pharmaceutical preparations for tolerizing and/or immunizing 
individuals are provided as well as methods relating thereto. 
Methods are provided for identifying other self and non-self 
epitopes involved in human autoimmune disease and similar 
pharmaceutical preparations and methods of use for these 
epitopes are also provided. 
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IDENTIFICATION OF SELF AND NON-SELF one lhai carries a negative charge al DRp 71; a positive 

ANTIGENS IMPLICATED AUTOIMMUNE charge (Arg) is found at DRP 71 io the RA associated DR4 

DISEASE molecules. 

With respect to multiple sclerosis, recent immunological 

This invention was supported in part by grant CA47554 5 studies suggest that myelin basic protein (MBP) may be one 

from the National Institutes of Health. The govermenl has of the important target antigens in the immu no pat ho gene sis 

certain right in the invention. of l ^ d *f **, Scvcr f 1 st H dlcs hav * ^"f 31 ^ * at M ® P 

specific T cells are clonally expanded in MS patients and in 

FIELD OF THE INVENTION an in vivo activated state (Allegretta et al., 1990; 

30 Wuchcrpfcnnig et al, 1994b; Zhang ct al., 1994). Reactivity 

The present invention relates to the field of immunology w ith the immunodominant MBP(84-102) peptide is found 

and, in particular, to the identification of self and non-self predominantly in subjects carrying HLA-DR2 (the most 

antigens implicated in human autoimmune responses. The common subtype of which is DRB1 * 1501), a genetic marker 

invention relates to methods of identifying such self and for susceptibility to MS. The MBP(84-102) epitope can also 

non-self antigens and provides examples of such antigens t> c presented by other MHC class II antigens, including 

relating to multiple sclerosis and pemphigus vulgaris. The 3 HLA-DQl (Ota et al., 1990; Martin et al., 1990; Pcltc et al., 

invention also relates to the use of such antigens for in vitro 1990- Wucherpfennig et al. t 1994a). In vivo, the T cell 

assays, animal models, therapeutic agents and vaccines. response to this peptide appears to be dominated by a few 



BACKGROUND OF THE INVENTION 



expanded clones. 
20 While associations between MHC alleles and disease 
Human autoimmune diseases have a striking genetic states have implicated autoimmunity in the aetiology of 
association with particular alleles of major histocompatabil- these diseases, a large body of clinical and epidemiological 
ity complex ("MHC) class I or class II genes. The field was evidence suggests that infections may be important in the 
established by the seminal discovery of HLA-B27 linked induction of autoimmunity. For example, particular viral 
susceptibility to ankylosing spondylitis, a chronic inflam- 25 infections frequently precede autoimmune myocarditis and 
matory joint disease (Brewerton (et al., 1973; Schlosslein et type I diabetes (IDDM) (Rose et al., 1986; Ray et al., 1980). 
al., 1973). MHC associated susceptibility has now been Environmental agents also influence the risk of developing 
documented for a variety of human autoimmune diseases, multiple sclerosis as demoastrated by migration studies, 
including insulin dependent diabetes mellitus (IDDM), rheu- Individuals that migrate after age 15 carry the risk for 
matoid arthritis (RA), pemphigus vulgaris (PV), multiple 30 developing MS associated with their geographic origin 
sclerosis (MS) and myasthenia gravis (MG), just to name a while individuals who migrate earlier in life acquire the risk 
few (Todd et al., 1987; Ahmed et al., 1990; Ahmed et al. of the geographical region to which they migrated (Kurtzke, 
1991; Lanchbury & Panayi, 1991; Spielman & Nathenson, 1985). These studies are consistent with the hypothesis that 
1982; Protti et al., 1993). a group of pathogens that are relatively ubiquitous in a 
The MHC locus most commonly associated with autoim- 35 certain geographic region influence the risk of developing 
mune disease is the HLA-DRB locus (also known as DRBl), multiple sclerosis (MS). The raechanism(s) leading to clonal 
a highly polymorphic locus with over fifty known alleles. expansion of MBP reactive T cells remain to be identified 
For example, a large body of epidemiological work has but could involve recognition of viral peptides with suffi- 
documented the association of rheumatoid arthritis with the cient structural similarity to the immunodominant MBP 
DR4 (DRB1*0401, DRB1*0404) and DR1 (DRB1*0101) 40 peptide. The initiation of autoimmunity by such a mecha- 
alleles, with the DR4 alleles conferring a higher risk than nism could then lead to sensitization to other CNS self 
DR1 (Lanchbury & Panayi, 1991), The risk is dramatically antigens by determinant spreading (Lehmann et al., 1992; 
increased when the subject is homozygous or heterozygous Kaufman et al. 1993; Tisch et al., 1993). Consonant with this 
for DRB1*0401 and/or DRB1*0404. The observation that hypothesis, it has been noted that inflammatory CNS disease 
arthritis is associated with three DR alleles that are struc- 45 can follow infection with a number of common viral 
turally similar led to the development of the 'shared epitope' pathogens, such as measles and rubella. On the other hand, 
hypothesis as DRB1*0401, 0404 and 0101 share critical the absence of virus in the CNS of these patients and 
polymorphic residues in the DR0 67-71 cluster (Gregersen reactivity to myelin basic protein in these patients suggest an 
el al. 1987; Lanchbury & Panayi, 1991). These residues (in autoimmune mechanism (Johnson et al., 1984). 
particular DRP 71) appear to be critical in defining the 50 Efforts to identify sequence homologies between self 
selectivity of peptide binding to the disease associated peptide epitopes that might be involved in autoimmunity and 
molecules. various bacterial and viral pathogens have therefore been 
Pemphigus vulgaris is an autoimmune disease of the skin made. These homology searches have focused on align- 
in which high titer autoantibody production to an epidermal ments with sequence identity. No success has been reported 
cell adhesion molecule (desmoglein 3) results in a loss of 55 using such alignments in identifying epitopes from patho- 
keratinocyte adhesion (acantholysis) and severe blister for- gens that could cross react with presumably pathogenic T 
mation (Amagai et al., 1991). Indifferent ethnic groups the cell lines from human patients with autoimmune disease 
disease is associated either with a DR4 allele (DRB1*0402) (Oldstone. 1990). A sequence identity was recently found 
or with a rare DQ1 allele (DQB 1*05032); only a small between an epitope in a Coxsackie virus protein and 
fraction of PV patients have neither susceptibility gene 60 GAD65, suspected of being an autoantigen in diabetes. 
(Ahmed el al., 1 99 1 ; Ahmed et al., 1 990; Scharf et al., 1 988). These peptides could reciprocally generate polyclonal T cell 
The DR4 subtype associated with pemphigus differs only at fines from mice that cross react with the other peptides 
three positions in the DRp 67-71 cluster from the DR4 (Tian,etal., 1994). No evidence, however, was provided that 
subtype associated with RA. The PV associated molecule these peptides could stimulate clones from diabetic mice (or 
has a negative charge (Glu) at the critical position (DRP 71); 65 humans). 

the neighboring position (DRP 70) is also negatively Recent developments in the field, in particular the iden- 

charged. The DR4 subtype associated with PV Is the only tification of allele specific peptide binding motifs have 
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transformed the field (Madden et al., 1991; Rolschke & Falk, 
1991). Based on this knowledge the structural basis for 
MHC linked susceptibility to autoimmune diseases can be 
reassessed at a level of detaiJ sufficient for solving long- 
standing questions in the field. Motifs for peptide binding to 5 
several MHC class I and class II molecules have been 
defined by sequence analysis of naturally processed peptides 
and by mutational analysis of known epitopes. MHC class I 
bound peptides were found to be short (generally 8-10 
amino acids long) and to possess two dominant MHC anchor 1Q 
residues; MHC class II bound peptides were found to be 
longer and more heterogeneous in size (Madden el al., 1991; 
Rotschke & Falk, 1991; Jardetzky et al. 1991, Chicz et al. 

1993) . Due to the size heterogeneity, however, it has proven 
more difficult to define MHC class II binding motifs based 
on sequence alignments. More recently, a crystal structure 3 
for HLA-DR1 demonstrated that there is a dominant hydro- 
phobic anchor residue close 10 the N-terminus of the peptide 
and that secondary anchor residues are found at several other 
peptide positions (Brown et al., 1993). Even this work, 
however, could not provide a detailed description of the 20 
binding pockets of HLA-DR proteins, the particular residues 
involved in the formation of these pockets of the structural 
requirements or antigens for MHC binding. 

In the present disclosure, a detailed description of the 
HLA-DR antigen binding pockets is provided (Stern ct al, 25 

1994) . With this information, together with functional infor- 
mation defining those amino acids of the self or non-self 
antigen that arc needed for MHC binding and TCR contact 
(e.g., Wucherpfennig el al. 1994a,), binding motifs for the 
various HLA-DR allotypes may be developed, self epitopes 30 
involved in autoimmune disease may be identified and a 
method is provided for identifying bacterial and viral 
epitopes which may initiate a human autoimmune response. 

SUMMARY OF THE INVENTION 
The present invention provides, in one aspect, seven 
different isolated polypeptides derived from the human 
desmoglein 3 protein and implicated as self epitopes in the 
autoimmune disease pemphigus vulgaris (PV). These pep- 
tides consist essentially of the seven amino acid sequences 40 
disclosed herein and designated SEQ ID NO.: 1 through and 
SEQ ID NO.: 7. In particular, the invention provides isolated 
peptides which consist of these sequences, the core MHC 
binding residues of these sequences, or the inner core MHC 
binding residues of these sequences. 45 

The present invention provides, in another aspect, eight 
different isolated polypeptides derived from human patho- 
gens but implicated in the aetiology of the autoimmune 
disease multiple sclerosis. These peptides consist essentially 
of the eight amino acid sequences disclosed. here in and 50 
designated SEQ ID NO.: 8 through SEQ ID NO.: 15. In 
particular, the invention provides isolated peptides which 
consist of these sequences, the core MHC binding residues 
of these sequences, or the inner core MHC binding residues 
of these sequences. 55 

In another set of embodiments, the invention provides for 
pharmaceutical preparations for use in tolerizing individuals 
to autoantigens. The preparations include a pharmaceuti- 
cally acceptable carrier and an isolated human polypeptide 
which includes an amino acid sequence corresponding to a 60 
sequence motif for an HLA-DR protein which is associated 
with a human autoimmune disease. These polypeptides are 
capable of binding to the HLA-DR protein to form a 
complex which activates autoreactive T cells in subjects 
having the autoimmune disease. The peptides arc not 65 
derived from human collagen or human myelin basic pro- 
tein. 
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In particular embodiments, such pharmaceutical prepara- 
tions arc provided in which the HLA-DR protein is HLA- 
DR4 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, a particular sequence motif is provided 
for pemphigus vulgaris and pharmaceuticals having peptides 
with this motif are provided. Specific embodiments of the 
pharmaceuticals include each of the polypeptides described 
above with respect to pemphigus vulgaris. Thus, methods of 
tolerizing an individual to a pemphigus vulgaris autoantigen 
are also provided. 

In another set of embodiments, the invention provides for 
pharmaceutical preparations for use in tolerizing individuals 
to antigens of human pathogens which are implicated in 
human autoimmune disease. The preparations include a 
pharmaceutical^ acceptable carrier and an isolated human 
pathogen polypeptide which includes an amino acid 
sequence corresponding to a sequence motif for an HLA-DR 
protein which is associated with a human autoimmune 
disease. These polypeptides are capable of binding to the 
HLA-DR protein to form a complex which activates autore- 
active T cells in subjects having the autoimmune disease. 

In particular embodiments, such pharmaceutical prepara- 
tions are provided in which the HLA-DR protein is HLA- 
DR2 protein and the autoimmune disease is multiple scle- 
rosis. In addition, three particular sequence motifs are 
provided for multiple sclerosis and pharmaceuticals having 
peptides with at least one of these motifs are provided. 
Specific embodiments of the pharmaceuticals include each 
of the polypeptides described above with respect to multiple 
sclerosis. Thus, methods of tolerizing an individual to a 
multiple sclerosis foreign antigen are also provided. 

In another aspect of the invention, pharmaceuticals are 
provided for vaccination against a human pathogen impli- 
cated in the aetiology of autoimmune disease. These phar- 
maceutical preparations include a pharmaceutically accept- 
able carrier and an immunogenic preparation effective to 
immunize against a human pathogen. The human pathogen 
is one which in its native form includes a polypeptide having 
an amino acid sequence corresponding to a sequence motif 
for an HLA-DR protein which is associated with the autoim- 
mune disease. These polypeptides are capable of binding to 
the HLA-DR protein to form a complex which activates T 
cells which become autoreactive and inliate the autoimmune 
disease. The preparations of the present invention specifi- 
cally do not include such polypeptides but, rather, include 
other antigens from the pathogen. 

In particular embodiments, such pharmaceutical prepara- 
tions are provided in which the HLA-DR protein is HLA- 
DR4 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, a particular sequence motif is provided 
for pemphigus vulgaris and pharmaceuticals which lack 
peptides having this motif are provided. Specific embodi- 
ments of the pharmaceuticals include preparations lacking 
each of the polypeptides described above with respect to 
pemphigus vulgaris. Thus, methods of immunizing an indi- 
vidual against pathogens which may cause pemphigus vul- 
garis are also provided. 

Similarly, pharmaceutical preparations are provided in 
which the HLA-DR protein is HLA-DR2 protein and the 
autoimmune disease is multiple sclerosis. Three particular 
sequence motifs are provided for multiple sclerosis and 
pharmaceuticals which lack peptides having any of these 
motifs are provided. Specific embodiments of the pharma- 
ceuticals include preparations lacking each of the polypep- 
tides described above with respect to multiple sclerosis. 
Thus, methods of immunizing an individual against patho- 
gens which may cause multiple sclerosis are also provided. 
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llie pharmaceutical preparations for immunizing against CPK models and HLA-DR side chains in contact with the 

pathogens that may cause multiple sclerosis may particularly peptide arc indicated as stick models. Pockets arc viewed in 

include inactivated forms of the pathogen in which the the plane of the peptide binding site, towards the N terminus 

polypeptide corresponding to the motif are removed or of the peptide (PI and P6), towards the C terminus of the 

recombinantly altered. Thus, the invention provides such 5 peptide (P4), towards the p, helical region (P7) or towards 

vaccines in which the pathogen and polypeptide are, the a x helical region (P9). A full color version of this figure 

respectively, Herpes simplex virus and UL15 protein, Her- may be found in Stern et al., 1994. 

pes simplex virus and SEQ ID NO.: 8, Adenovirals and DETAILED DESCRIPTION OF THE 

Adenovirus ORF protein, Adenovirus and SEQ ID NO.: 09, INVENTION 

Pseudomonas aeruginosa and phosphomannomulase 10 . . . 

n , * . enn rr\ Ktr\ . in The present invention relates to a method of identifying 

protein, Pseudomonas aeruginosa and SEQ ID NO.: 10, , , - -j r .u • u-i-. . ■ j - 

n mi j in . • n ii ccn in and evalua tine peptides for their ability to induce an au toim- 

Papillomavirus and L2 protein, Papillomavirus and SEQ ID 6rK , , i 

NO, 11, Epslein-Barr virus ami DNA polymerase protein, mune , res P° nse or i . ,0 ™ Xo ™™ «°f d.sease. In 

Epstein-Barr virus and SEQ ID NO.: 12, Influenza virus and particular the invention relates to methods of (1) evaluating 

, - .-in .^ccnFnMA.u ,c self peptides for their potential involvement in autoimmune 

hemagglutinin protein, Influenza virus and SEQ ID NO.: 13, 15 * * . If , , 

„ . • . „ ' . n „ icen mun. disease when the self epitope or autoantigen is unknown and 

Reovirus and sigma 2 protein, Reovirus sand SEQ ID NO.: . c • \-a r .u • ui - i 

, A IT * . , nKU , * „ ^ (2) evaluating foreign peptides for their possible uwolve- 

14, Herpes simplex virus and DNA polymerase, and Herpes v ' . . • , /\ * ™ • 

sim lex and SEO ID NO * 15 ment in the aetiology of autoimmune disease. The invention 

" i . j r also relates to speciGc peptides identified by the methods of 

The present invention also provides general methods for ^ iQVention and rC p rcS enting self and foreign antigens 

evaluating a peptide for an ability to induce an autoimmune 20 . . respectively, pemphigus vulgaris and multiple 

response. These methods involve choosing an MHC HLA- sclerosis 

DR molecule associated with the autoimmune response, __ ' . . 4 . , . t c ., 

, . WTI « .... t . * «u The methods rely upon the development of ammo acid 

selecting at least two major MHC binding pockets of the . r 4 ' . f , . 4 . . f- f 

77. * Tin . 1 .-r • . <■ • . « sequence motifs to which potential self or foreign epitopes 

HLA-DR molecule, identifying sets of amino acid residues ^ , , r . m L:c A ^^:u^ „ „. ~e ;™ 

r , , i * . » , may be compared. Each motif describes a finite set 01 ammo 

which bind within each of the selected pockets, developing 23 .* * ... # . „ . . «. „, ^ „. , - ,. t .„^ 

■c c x. 1 it * i^r> 1 1 * u- u .u acid sequences in which the residues at each (relative) 

a sequence mouf for the HLA-DR molecule in which the . . M . . . t . , t . . „ > „ ' 

^ r , - . n . . tU position may be (a) restricted to a single residue, (b) allowed 

sets of amino acids define the allowed amino acids at the r J / ' . • . j . r -j / \ n j 

rL rj.u - to vary amongst a restricted set of residues, or (c) allowed 

corresponding , pos.fons of the mouf and then company ^ ^ g ^ ^ , his 

the amino acid sequence of the peptide to the sequence djsd £ b * wit £ u , limkj lnc inveDtion jn any 

motit Peptides which match , he motif have ^uch greater 30 • ^ ^ ^ * f 

likehhood of inducing the autoimmune disease. In addition, ^ ^ itio / wnich ^ restrjcted , 0 a 

if there is a known epitope implicated m the disease . he ^ be resen(ed b the one ., cUer abbre . 

method may further mclude selecting at least one ILK , c , . . . , . n . 

■I f L . -j .-r - . r • j viation for that residue, (b) a position which is allowed to 

contact residue of the epitope, identifying a set 01 ammo acid _ „ ■« . a k„ n 

... I. -™~n . * § • * j* vary amongst a set of residues will be represented by a 

residues which may serve as the TCR contact, and including 3, * * one . le «er abbreviations for those residues, 

this set in the motif at the appropnate position, n preferred ^ ^ fa to ^ aU 

embodiments, the motifs ' n ^>^^ amino acid residues will be represented by an "X." As an 

at positions corresponding to *t least the PI MHC binding on ^ ^ &( a 

pocket and at least one of the P4 and P6 pockets. P q ^ ^ ^ valine) ^.^^ 

In another embodiment of the invention, methods are 40 faoteudnCf methionine, or phenylalanine; thai the residue at 

provided specifically for identifying foreign antigens impli- ^ hion mus( ^ hislidinc; thal lhe residue at lhc 

cated in human autoimmune response. These methods ^ ilion ^ amino acid rcsidue; that the 

include the same steps as the previously described methods, a( ^ fourlh Uioa may ^ ^ OQe Qf ^ fesidues 

but further include a comparison of the resu tmg sequence vaH ^ isoleucine, methionine, phenylalanine, 

motif to sets of human pathogens. In preferred ine Qr fgy ^ op]mi and mal the at ^ fifih 

embodiments, peptide sequences from one or more species . (ion mu$t be Iysine Such a motif would ^ re p rese nted 

in lhe normal human intestinal Dora are excluded from by the following character string: 
consideration. In another preferred embodiment, sequences 

from one or more species of pathogen which is negatively v h X v k 

correlated with the incidence of the disease are excluded. Id 50 L L 

a most preferred embodiment, the human pathogen peptides I I 

arc searched and evaluated on a computer database using the ^ 

motif as a search criterion. y 

BRIEF DHSCRIFriON OP THE DRAWINGS 55 w 

FIG. 1 illustrates the pockets in the HLA-DR1 peptide 
binding site. The upper right shows a top view of the In one aspect of the present invention, sequence motifs are 
molecular surface (1.5 Angstrom probe radius) of the HLA- developed by analysis of the binding domains or binding 
DR1 peptide -binding site with the Ca trace of the HLA-DR 1 pockets of major histocompatibility complex HLA-DR pro- 
shown as thin lines. Pockets that accomodate the peptide 60 teins and/or the T cell receptor ("TCR") contact points of 
side chains of an antigen (in this case Influenza virus epitopes bound to MHC molecules. By providing a detailed 
hemagglutinin) are shown in detail in surrounding views, structural analysis of the HLA-DR residues involved in 
and are numbered as described herein. The PI pocket forming the MHC binding pockets, one is enabled to make 
accomodates the Tyr (308) of the antigen. The P4,P6,P7 and predictions of sequence motifs for binding to any of the 
P9 pockets bind antigen residues Gin (311), Thr (313), Leu 65 HLA-DR proteins. 

(314) and Leu (316) respectively. Antigen peptide side In another aspect of the present invention, sequence 

chains and the nearby peptide main chain are represented by motifs developed by the methods disclosed herein may be 
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used to identify self peptide epitopes involved in an autoim- 
mune response when the autoantigen is known or suspected. 

Id another aspect of the present invention, methods of 
identifying foreign peptide epitopes implicated in autoim- 
mune disease arc provided. I"hese methods involve the use 
of MHC and/or TCR binding motifs to identify peptides 
derived from certain classes of organisms or pathogens 
which may initiate human autoimmune response. In this 
aspect, the motifs may be developed according to the 
methods of the present invention or by other means known 
in the art. 

Using these sequence motifs as search, evaluation, or 
design criteria, one is enabled to identify classes of peptides 
which have a reasonable likelihood of binding to a particular 
MHC molecule and of interacting with a T cell receptor to 
induce T cell and/or autoimmune response. Use of these 
motifs, as opposed to pure sequence homology (which 
excludes many peptides which are anligenically similar but 
quite distinct in sequence) or sequence homology with 
unlimited "conservative" substitutions (which admits many 
peptides which differ at critical highly conserved sites), 
represents a significant advance in the ability of one of 
ordinary skill in the art to evaluate particular peptides for 
potential involvement in autoimmune disease and to search 
computer databases of peptide sequences to identify self and 
foreign peptides which may be involved in autoimmune 
response. In addition, the use of MHC and/or TCR binding 
motifs to search limited databases for foreign peptides which 
may be implicated in the aetiology of autoimmune disease is 
a novel application of the concept of binding motifs. 

Detailed examples of the practice of the present invention 
are presented below. The methods of the invention have now 
been used to identify the previously unknown self peptide 
epitopes involved in the autoimmune disease pemphigus 
vulgaris. In addition, the methods have been used to identify 
a class of foreign peptides which may be involved in the 
initiation of autoimmune response in multiple sclerosis (for 
which the self epitope had previously been identified). 

Thus, in another embodiment, the invention provides 
these peptides, in isolated form, which may be used in 
various diagnostic and therapeutic methods and products 
alluded to below. 

1. The MHC Class II HLA-DR Molecular Mimicry Motif 
The HLA-DR binding site is characterized by five major 
pockets which may bind the amino acid side chains of 
antigens (Stern el al., 1994, the entire disclosure of which is 
incorporated herein by reference). See FIG. 1. The amino 
acid residue of the antigen which binds in the first major 
pocket is designated PI . The remaining residues may then be 
numbered by their positions relative to PI (with positive _ ( 
numbers increasing toward the carboxy terminus and nega- 
tive numbers increasing toward the amino terminus): 

Pi ... P-l PI P2 P3 P4 ... Pj 

Thus, the first major pocket of an HLA-DR molecule, by 55 
definition, binds the side chain of residue PI on an antigen. 
The remaining major pockets bind residues P4, P6, P7 and 
P9. These residues are defined as the major MHC contact 
residues. 

The amino acid side chains of residues P-l, P2, P3, P5, 
P8, and Pll are oriented away from the HLA-DR binding 
site and, therefore, arc available as contact residues for a T 
cell receptor (TCR). All of these residues are defined as ICR 
contact residues. 

A. The MHC Contact Residues 

The first major pocket of the HLA-DR molecule is 
strongly hydrophobic. It is formed by a stretch of residues at 
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about positions 85, 86, 89 and 90 of the p chain, a stretch of 
residues at about positions 31, 32 and 34 of the a chain, and 
side chains from residues at about positions 7 and 43 of the 
a chain. For example, in HLA-DR 1 (DRA, DRB1*0101), 
5 the first pocket is formed by residues p85 (Val), p86 (Gly), 
P89 (Phe), p90 (Thr), a31 (He), a32 (Phe), a34 (Phe), al 
(He), and u43 (Trp). The corresponding residues for other 
HLA-DR alleles are known in the art (see, e.g., Marsh and 
Bodmcr, 1992, incorporated by reference herein) and arc 
10 available through genetic databases. 

Although most of the residues that shape the PI pocket are 
from the highly conserved DRa chain, the size and nature of 
this pocket varies due to polymorphisms in the p chain 
residues involved in the pocket. For the DRB1*0101 
15 protein, the pocket is large and hydrophobic and can accom- 
modate any of the aliphatic or aromatic residues. Polymor- 
phism at the p residues, however, may alter the binding 
capacity of the PI pocket. For example, the p86 residue is 
known to be polymorphic. Most commonly, this site is 
20 occupied by either Gly or Val. Generally, when Gly is 
present at P86 (as in DRB 1*0101 ), any of the aliphatic or 
aromatic residues may bind within the pocket. When Val is 
present, however, the pocket is smaller and Tyr and Trp 
cannot be accommodated. Thus, when p86 is Gly, position 
25 PI of the molecular mimicry motif may consist of residues 
chosen from V, L, I, A, M, F, Y, W and when P86 is Val, 
position PI of the motif may consist of residues chosen from 
V, L, I, A, M, F. Similar considerations apply to the other p 
residues of the Pi pocket. 
30 The P4 pocket of HLA-DR molecules is also a relatively 
large, shallow, hydrophobic pocket oriented across the anti- 
gen binding site. This pocket can bind a variety of large 
aliphatic side chains which can maintain hydrophobic inter- 
actions along the side and floor of the pocket. The pocket is 
35 formed by a stretch of residues at about positions 70, 71, 74 
and 78 of the p chain, and side chains from residues at about 
position 13 of the p chain and about position 9 of the a 
chain. For example, in HLA-DR 1 (DRA, DRB1*0101), the 
P4 pocket is formed by residues P70 (Gin), P71 (Arg), P74 
40 (Ala), P78 (Tyr), pi3 (Phe), and a9 (Gin). The correspond- 
ing residues for other HLA-DR alleles are known in the art 
(see, e.g., Marsh and Bodmer, 1992) and are available 
through genetic databases. 

Like the PI pocket, the P4 pocket is largely hydrophobic 
but its binding capacity is affected by polymorphisms at the 
p residues involved in the pocket. For example, different DR 
alleles have differently charged residues al position p71: In 
DRB1*0404, p71 is occupied by a positively charged Arg 
residue whereas in DRB1*0402 P71 is a negatively charged 
Glu residue. Thus, although this pocket can generally bind 
a variety of aliphatic or aromatic side chains (e.g., V, L, I, A, 
M, F, Y, W), positively charged P4 antigen residues are 
disfavored when P71 is positively charged and, similarly, 
negatively charged P4 residues are disfavored when p71 is 
also negative. Similar considerations apply to the other p 
residues of the P4 pocket. Note that some residues may be 
involved in forming each of two adjacent pockets (e.g., pl3 
in the P4 and P6 pockets) and therefore the occupancy of one 
of these pockets by a particular amino acid may influence the 
occupancy of the other. 

The P6 pocket of HLA-DR molecules is a relatively 
shallow pocket with a preference for smaller (e.g., A, G) P6 
antigen residues. The pocket is formed by the highly con- 
served all, a62, a65 and a66 residues and the highly 
polymorphic pll and pl3 residues of the HLA-DR protein. 
For example, in HLA-DR1 (DRA, DRB1*0101), the P6 
pocket is formed by residues al 1 (Glu), a62 (Asn), aG5 
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(Val), u66 (Asp), (511 (Leu) and pi3 (Phc). The correspond- 
ing residues for other HLA-DR alleles are known in the art 
(sec, e.g., Marsh and Bodmcr, 1992) and arc available 
through genetic databases. 

Although there are only two p chain residues in the P6 
pocket, they vary widely amongst the DR alleles. With a 
large Phe residue at pi3(as in DRB1*0101), the P6 residue 
is preferably one of the small residues (e.g., A, G). In other 
DR alleles, however, pl3 is occupied by smaller or more 
polar residues such as the (513 (His) of DRB1*0401. For 
such alleles, the P6 motif may include somewhat larger and 
polar residues (e.g., S, T, V) but should still avoid the largest 
and aromatic residues. Finally, in some alleles, pll and pl3 
are both serine residues (e.g., DRB 1*1101) and for these 
cases more hydrophilic or hydrogen booding residues may 
be included in the motif. 

The P7 pocket of HLA-DR molecules is also a relatively 
shallow pocket. The pocket is formed by five residues of the 
p chain: p28, P47, p61 f p67 and P71. For example, in 
HLA-DR 1 (DRA, DRB*0101), the P7 pocket is formed by 
residues 028 (Glu), p47 (tyr), p61 (Trp), P67 (Leu) and P71 
(Arg). The corresponding residues for other HLA-DR alleles 
are known in the art (see, e.g., Marsh and Bodmer, 1992) and 
are available through genetic databases. This pocket docs 
not appear to contribute greatly to the specificity of HLA- 
DR 1 but may be important in other alleles. 

The P9 pocket of HLA-DR molecules is generally a small 
hydrophobic pocket and, therefore, small hydrophobic resi- 
dues are preferred at the P9 position of the antigen. This 
pocket is formed by the conserved a chain residues a69, 
a72, a73 and a76 and by the polymorphic p chain residues 
P9 and p57. For example, in HLA-DR1 (DRA, 
DRB1*0101) the P9 pocket is formed by a69 (Aso), a72 
(lie), a73 (Mel), a76 (Arg), p9 (Trp) and P57 (Asp). The 
corresponding residues for other HLA-DR alleles are known 
in the art (sec, e.g., Marsh and Bodmcr, 1992) and arc 
available through genetic databases. 

The P6, P7 and P9 pockets appear to be less important 
than the PI and P4 pockets in binding to DR molecules but 
they may be more important in binding to other Lsotypes 
(e.g., the P9 pocket of DQ may be important). 

B. The TCR Contact Residues 

When there is no known or suspected antigen involved in 
an autoimmune response, the positions of the sequence 
motif corresponding to the TCR contact residues may be left 
unrestricted. That is, absent a known or suspected antigen, 
the TCR contact positions of the motif are preferably 
allowed to vary amongst all of the amino acids. 

When, on the other hand, there is a known or suspected 
antigen involved in an autoimmune response, at least some 
of the motif positions corresponding to the TCR contact 
residues may be restricted according to the sequence of the 
antigen. Thus, for example, the P2 and/or P3 and/or P5 
positions of the motif may be restricted to only those 
residues found at the corresponding positions of the antigen. 
Alternatively, at least some of the TCR contact residues of 
the motif may be restricted not just to the corresponding 
residues of the antigen but may be allowed to vary amongst 
similarly charged and/or structurally similar residues (e.g., 
K and R). It should be noted, however, that greater conser- 
vatism with respect to the TCR contact residues of the motif 
is justified by the presumably greater specificity of TCR 
binding relative to the known promiscuity of MHC binding. 

C Developing an HLA-DR Sequence Motif 

Given the present disclosure of the HLA-DR residues 
involved in the formation of the PI, P4, P6, P7 and P9 MHC 
binding pockets, and given the nucleotide or corresponding 
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amino acid sequence of any particular HLA-DR allele, one 
is now enabled to develop a sequence motif useful in 
evaluating or predicting the ability of peptides to bind to that 
MHC protein. When a particular antigen is known to (or is 
suspected of) binding to the MHC protein, the TCR contact 
residues of that antigen may also be considered in the motif. 

The method first requires the selection of two or more of 
the MHC binding pockets for which the choice of peptide 
residues will be restricted at the corresponding positions of 
the motif. One may select all five of the major binding 
pockets and develop a motif in which the corresponding five 
positions of the motif arc restricted or one may select fewer 
and develop a less restricted motif. As will be obvious to one 
of ordinary skill in the art, a more restricted motif will 
identify a lesser number of peptides in a database search and 
a less restricted motif will identify a greater number of 
peptides. In all instances, at least two of the major binding 
pockets should be selected. When fewer than all five MHC 
binding pockets are selected, it is preferred that at least one 
is PI and that a second is chosen from P4, P6 and P9. 

Either before or after the pockets to be restricted by the 
motif are selected, the set of amino acid side chains likely to 
bind within each of those pockets and, therefore, the set of 
amino acid residues that will define the corresponding 
positions of the motif, must be determined. This may be 
accomplished by one of ordinary skill in the art by consid- 
ering the amino acid residues which form the pocket. These 
residues, identified in Section A above, will determine the 
size and nature (i.e., hydrophobic, hydrophilic, positively 
charged, negatively charged, uncharged) of the pocket and 
consequently, the side chains which may bind within the 
pocket. Reference may be had to FIG. 1 during these 
considerations but will become increasingly unnecessary as 
one develops familiarity with the variations of the pockets. 

As a general matter, in light of the identification of the 
residues forming the MHC binding pockets of the HLA-DR 
proteins disclosed herein, one of ordinary skill in the art can 
easily develop a sequence binding motif for any HLA-DR 
protein for which these residues are known for two or more 
binding pockets. The major considerations are size, hydro- 
phobicity and charge. In light of the present disclosure, each 
of these considerations may be addressed according to 
well-known principles. A baseline is disclosed herein for 
each pocket for the DRB 1*0101 allele. Relative to this 
HLA-DR protein, one of ordinary skill is enabled to develop 
motifs for other HLA-DR alleles. Thus, substitutions which 
lead to larger/smaller pockets suggest that the corresponding 
motif positions should be restricted so as to permit smaller/ 
larger residues. Similarly, more/less hydrophobic pockets 
suggest that the corresponding motif positions should be 
restricted to more/less hydrophobic residues. Finally, 
positively/negatively charged pockets suggest that 
positively/negatively charged residues should be excluded 
and negatively/positively charged residues may be included 
at the corresponding motif positions. As noted above, the 
present disclosure enables one of ordinary skill to develop 
motifs based upon these well-established principles. 

For example, and not by means of limitation, consider the 
PI pocket of the HLA-DR protein. The residues forming this 
pocket in the DRB1*0101 were described above. For 
DRB1*0101, the PI pocket is large and hydrophobic and 
can accommodate any of the aliphatic or aromatic residues 
(e.g., V, L, 1, A, M, F, Y, W). For the DRB1* 1602 protein the 
same is true. On the other hand, in the DRB1M501 protein, 
the p86 position is occupied by Val instead of the Gly found 
in DRB1*0101 and DRBl'1602. This substitution 
decreases the size of the PI pocket in this MHC protein and, 
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as a result, the pocket cannot easily accommodate Tyr or Trp 
side chains. Thus, for DRB1M50I, the sequence motif at 
position PI may be restricted to residues chosen from V, L, 

I. A, M and F. 

Similarly, in light of the present disclosure, one of ordi- 5 
nary skill in the art may consider each of the MHC binding 
pockets, or only selected pockets, and develop a sequence 
motif for any HLA-DR protein for which the residues 
involved in pocket formation are known. ITiese residues will 
determine both the size and nature of the pocket and, JQ 
thereby, the size and nature of the residues which may bind 
within it. When the pocket is relatively small, the largest 
amino acid residues (e.g., Y, W) may be excluded from the 
corresponding position of the motif and, when the pocket is 
charged, amino acid residues of the same charge may be 
excluded. 15 

If a self or foreign epitope involved in immune response 
is known or suspected, and particularly if its TCR contact 
residues can be defined through the use of responsive T cell 
clones, the TCR contact residues of the epitope may also be 
considered in developing a sequence motif. As with the 20 
MHC contact residues, all or merely some of the TCR 
contact residues may be restricted in the motif. And, as with 
the MHC positions, the restriction of more positions (or the 
greater restriction of any one position) will result in the 
identification of fewer peptides in a database search. Unlike 25 
the MHC contact residues, for which at least two positions 
should be restricted in the motif, it is acceptable to omit any 
restrictions of TCR contact residues in the motif. 

If any TCR contact residue positions are restricted in the 
sequence motif, it is preferred that a position selected from 30 
positions P2, P3 and P5 be chosen. Because, in contrast to 
the relative promiscuity of MHC binding pockets, TCR 
contact residues appear to have greater specificity, it is 
preferred that any TCR contact residue positions which are 
restricted in the motif be rather narrowly restricted. That is, 35 
it is preferred that such positions be restricted to just the 
residue found at the corresponding position of the known 
antigen or just to residues which are highly similar in 
structure and charge. 

For example, and as further detailed below, the MBP 40 
(85-99) peptide is known to be the autoantigen implicated in 
multiple sclerosis. The P3 residue of this peptide is Phe 
(residue 91 of MBP) and the P5 residue is Lys (residue 93 
of MBP). Thus, if P3 is restricted in the motif, it is preferred 
that it be restricted to F or, possibly, to F and Y. Similarly, 45 
if P5 is restricted, it is preferred that it be restricted to only 
K or to K and the similarly charged R. Alternatively, P3 and 
P5 may be left unrestricted. 

Obviously, MHC and TCR positions not selected for 
restriction may be represented by, in the notation of this 50 
disclosure, an X. Similarly, as shown in the examples below, 
several motifs may he developed with varying numbers of 
positions restricted to varying extents. 

II. Identifying Self Epitopes Using Sequence Motifs 

An ever increasing number of autoimmune diseases are 55 
now being associated with particular alleles of the MIIC 
Class II HLA-DR locus. For most of these autoimmune 
diseases, the self epitope remains unknown. For some, 
however, a self protein involved in autoimmune response is 
known or suspected. 60 

In one aspect of the present invention, a method is 
provided for identifying the self epitopes involved in 
autoimmune diseases associated with HLA-DR alleles. 'Iliai 
is, by comparing human peptide sequences to the sequence 
motifs of the present invention, one is enabled to identify 65 
those peptides which have the highest likelihood of being 
the self epitopes involved in the disease. 



The method may be applied to any of the autoimmune 
diseases for which an association with a particular HLA-DR 
allele is known and for which the amino acid residues 
forming the MHC binding pockets (or at least two of the 
major pockets) for that allele are known. In accordance with 
the method discussed herein, one may then develop one or 
more sequence motifs for the HLA-DR protein associated 
with the disease. Naturally, if the disease is associated with 
two or more alleles, motifs may be developed for the two or 
more HLA-DR proteins and, in particular, a consensus motif 
may be developed using the residues shared by each position 
of these motifs. 

The sequence motif or motifs thus developed are then 
compared to appropriate sets of human peptide sequences. 
The human peptide sequences may include all known human 
sequences or may be limited in ways which will be obvious 
to one of ordinary skill in the art. For example, if the disease 
is restricted to particular tissues, the search may be limited 
to peptides found in those tissues. Conversely, peptides 
which are also found in unaffected tissues may be eliminated 
from the search pool. In the most extreme case, when the 
autoantigen is known or suspected but the particular epitope 
is unknown, the search may be limited to sequences within 
the autoantigen (see Example 1). 

This method may be used to identify a set of peptides 
which match the motif and which are most likely to be self 
epitopes. By varying the number of positions restricted by 
the motifs, and/or the extent of restriction at each position, 
and/or the size of the search pool, the number of peptides in 
the set will, in all likelihood, also be varied. As noted above, 
at least two of the MHC contact positions (e.g., Pi and P4) 
should be restricted. Depending upon the number of pep- 
tides in the resultant set, a more or less restrictive motif may 
then be employed to reduce or expand the set. The desired 
size of the resultant set depends, of course, upon the sub- 
sequent intentions of the practitioner of this method. 

Once a set of peptides has been identified, these peptides 
may optionally be screened for activity. The choice of such 
screens is at the discretion of the practitioner and beyond the 
scope of the present invention. Preferred screens, however, 
include in vitro tests for the ability to induce the proliferation 
of autoreactive T cells or to induce the secretion of lym- 
phokines (cytokines) from these T cells or to induce other 
effector functions such as cytotoxicity. In some 
circumstances, human in vivo tests may be appropriate and 
in other circumstances animal models of the human disease 
may be available. 

III. Identifying Foreign Epitopes Implicated in Human 
Autoimmune Disease 

As noted in the Background section, epidemiological 
evidence has suggested that various bacterial and viral 
pathogens may Ik implicated in human autoimmune dis- 
eases and the concept of molecular mimicry pervades the 
literature (reviewed by Oldstone, 1990). Prior attempts to 
identify particular foreign epitopes involved in human 
autoimmune disease, however, have depended upon direct 
sequence similarity to known human epitopes. The results 
have been disappointing and, to date, no pathogens or 
peptides derived from pathogens have been shown to be a 
primary cause of human autoimmune disease. 

Thus, in another aspect of the present invention, a method 
of identifying foreign epitopes implicated in human autoim- 
mune disease is provided. That is, for the first time a method 
of identifying such foreign epitopes is provided which 
employs sequence motifs to identify foreign peptides which 
have the highest likelihood of being involved in the aetiol- 
ogy of human autoimmune disease. 
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Hie method may be applied lo any of the auloimmune epitopes involved in the aetiology or in remissions of 

diseases for which an association with a particular MHC multiple sclerosis. As detailed in Example 2 below, each has 

protein is known and for which cither (1) a sequence motif been shown to be capable of inducing the proliferation of 

has been developed by prior art methods or (2) a sequence autoreactive T cell clones isolated from human multiple 

motif may be developed by the methods of the present 5 sclerosis patients. The sequence of the M BP (85-99) peptide 

invention. When the self epitope is known or suspected, is disclosed as SEQ ID NO.: 16. 

TCR contact residues may be included in the motif. As Each of these proteins has a variety of utilities and, 

before, one or more motifs may be employed and differently therefore, in another aspect, the present invention provides 

derived motifs may be combined to develop consensus each of these peptides in isolated form. In addition to the 

motifs. io sequences of fifteen residues shown in the Sequence Listing 

The sequence motif or motifs thus developed may then be and Tables, the present invention also embraces the f rag- 
compared to appropriate sets of peptide sequences derived mcnts of these peptides corresponding to the MHC binding 
from human pathogens. This is most conveniently accom- domain. In particular, the invention provides peptides cor- 
plished using genetic databases widely available to those of responding to the P-2 lo Pll, P-l to P9 and P-l to P6 
skill in the art. In a most preferred embodiment, the search 15 positions of each of SEQ ID NO.: 1 through SEQ ID NO.: 
pool is limited in one or more of the following ways: (1) only 15. As will be obvious to one of ordinary skill in the art, 
sequences from human bacterial or viral pathogens are however, any fragment of any of these peptides which 
included; (2) sequences from the normal human intestinal includes at least the PI and P4 or at least the PI and P6 or 
flora (e.g.,£. colioxo\hztEnterobacteriaceac) are excluded; at least the P4, P6 and P7 residues may have utility and is 
and (3) sequences from pathogens are included/excluded 20 intended to fall within the spirit and scope of the claims. In 
depending upon whether the geographical or epidemiologi- particular, longer peptides including these peptides or pep- 
cal incidence of the pathogens are positively/negatively tides including at least the MHC binding and TCR contact 
correlated with the incidence of the autoimmune disease in residues described above are contemplated as equivalents, 
question (see Example 2). The manner of production of these peptides is inconse- 

This method may be used lo identify a set of foreign 25 quential but they may be isolated and purified from their 

peptides which match the motif and which are most likely to natural sources or they may be synthesized. Because of their 

be involved in the human disease. As before, the number of relatively short length, it is presently contemplated that they 

peptides in the set can be varied by using more or less should be produced by synthesis. Methods of isolation, 

restrictive motifs and/or by varying the search pool. And, as purification and synthesis of such peptides are well known 

before, the resultant set of peptides may subsequently be 30 in the art and need not be recited herein, 

subjected to any of a variety of known screens for activity. The peptides of the present invention may be used for in 

IV. Self and Foreign Epitopes Identified by the Methods of vitro assays to aid in the diagnosis and classification of 

the Present Invention pemphigus vulgaris and multiple sclerosis. For example, 

As detailed in the examples below, the methods of the autoreactive T cells from patients with PV and MS may be 

present invention have been employed (1) to identify seven 35 tested, as in the examples below or by other known assays, 

self epitopes of the dcsraoglcin 3 protein implicated in for reactivity with these peptides. The ability or inability of 

pemphigus vulgaris and (2) lo identify eight foreign epitopes these peptides to cause proliferation of the T cells will, in the 

from human pathogens implicated in multiple sclerosis. case of pemphigus, allow for a refinement of the diagnosis 

Each of these peptides is fifteen residues in length, partly by particular desmoglein 3 epitopes and, in the case of 

as a result of the computer database search program used 40 multiple sclerosis, allow further classification of the disease 

(Genetics Computer Group program "Findpatterns") but by cross-reactive (self and foreign epitope) types. Immune 

also corresponding to the size of the cleft in MHC class II response to these peptides prior to onset of disease may also 

molecules. The fifth position of each corresponds to the PI be used as an indication of susceptibility or predisposition 

residue of the antigen. Thus, the P-2 to Pll residues which although care should be taken so as not to induce autoanti- 

span the MHC Class II binding cleft correspond to the third 45 genie response. 

through fifteenth residues of these sequences. The P-l to P9 The present invention also provides products and methods 

residues which are important to MHC and TCR binding using other peptides which may be identified by the methods 

correspond to the fourth through thirteenth positions. The of the present invention. These peptides, as well as those 

most important residues for MHC and TCR binding, P-l to disclosed above, may be used in each of the following 

P6 correspond to the fourth through tenth positions of these 50 embodiments. 

sequences. Th c peptides of the present invention may be used in the 

SEQ ID NO.: 1 through SEQ ID NO.: 7, also shown in development of animal models by immunizing animals (e.g., 

Table 1 as PVA. 1 through PVA. 7, correspond to residues mice, rabbits, non-human primates) with these peptides. 

78-93, 97-111, 190-204, 206-220, 251-265, 512-526 and Animals which not only develop a response to the peptide 

762-786 of the human desmoglein 3 protein. These peptides 55 but which also develop an auloimmune disease correspond- 

are implicated as self epitopes in pemphigus vulgaris. Lag to the human pathology will have obvious utility as 

Already, as described in Example 1, two of these peptides models for the human disease. Animals which develop a 

have been shown to cause proliferation of T cells isolated response to the peptide without developing any correspond- 

from two patients with pemphigus. ing autoimmune disease, will have utility as subjects for 

SEQ ID NO.: 8 through SEQ ID NO.: 15, also shown in 60 experiments involving the selective depletion of T cells or 

Table 2 are internal fragments of the herpes simplex virus other forms of desensitizalion or ionization. 

UL15 protein, adenovirus Type 12 ORF, Pseudomonas Importantly, these peptides and amino acid analogs of 

aeruginosa phosphomannomutase, human Papillomavious these peptides will have utility as therapeutic and diagnostic 

Type 7 12 protein, Epstein-Barr virus DNA polymerase, agents. The pathogens, viruses or bacteria from which they 

Influenza Type A hemagglutinin protein, Reovirus Type 3 65 arc derived will have utility as vaccinating agents. Some 

sigma 2 protein, and herpes simplex DNA polymerase, examples of the utility of these materials include the fol- 

respectively. These peptides are implicated as foreign lowing. 
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ITie peptide may be administered in high doses to produce 
high dose tolerance. This process of tolerizaiion is described 
in, for example, PCT patent application US93/08456 
(International publication number WO 94/06828). llius, in 
one set of embodiments, the invention provides for phar- 
maceutical preparations for use in tolerizing individuals to 
auloanligens. The preparations may include a pharmaceuli- 
cally acceptable carrier and an isolated human polypeptide 
which includes an amino acid sequence corresponding to a 
sequence motif for an HLA-DR protein which is associated 
with a human autoimmune disease. These polypeptides are 
capable of binding to the HLA-DR protein to form a 
complex which activates autoreactive T cells in subjects 
having the autoimmune disease. By using the peptides 
disclosed herein or identified by the methods of the 
invention, such pharmaceuticals can be used to combat 
autoimmune response. The use of such tolerizatioo to human 
autoimmune diseases is known in the art and need not be 
elaborated upon here. Tolerizing doses of collagen for 
rheumatoid arthritis and myelin basic protein for multiple 
sclerosis have been used. The present invention therefore 
specifically does not embrace these proteins. Other peptides, 
however, may now be identified by the present methods and 
be similarly used to treat autoimmune disease. 

In particular embodiments, such pharmaceutical prepara- 25 
tions are provided in which the HLA-DR protein is HLA- 
DR4 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, using PV motif #1, pharmaceuticals 
having peptides with this motif are provided. In most 
preferred embodiments, the pharmaceuticals include at least 30 
one of the polypeptides of SEQ ID NO.: 1 through SEQ ID 
NO.: 7. Thus, methods of tolerizing an individual to a 
pemphigus vulgaris autoantigen are also provided. 

In a similar set of embodiments, the invention provides 
for pharmaceutical preparations for use in tolerizing indi- 35 
viduals to antigens of human pathogens which arc impli- 
cated in human autoimmune disease. The preparations 
include a pharmaceutically acceptable carrier and an isolated 
human pathogen polypeptide which includes an amino acid 
sequence corresponding to a sequence motif for an HLA-DR 40 
protein which is associated with a human autoimmune 
disease. These polypeptides are capable of binding to the 
HLA-DR protein to form a complex which activates autore- 
active T cells in subjects having the autoimmune disease. 
Thus, by tolerizing an individual to these antigens, T cells 45 
which are cross-reactive with the self antigen will be ren- 
dered unresponsive or anergized and protection from the 
disease will be afforded. 

In particular embodiments, such pharmaceutical prepara- 
tions are provided in which the HLA-DR protein is HLA- 
DR2 protein and the autoimmune disease is multiple scle- 
rosis. In addition, using the the three MS motifs disclosed 
herein, pharmaceuticals having peptides with at least one of 
these motifs are provided. Specific embodiments of the 
pharmaceuticals include at least one of the polypeptides 
disclosed as SEQ ID NO.:8 through SEQ ID NO.: 15. Thus, 
methods of tolerizing an individual to a multiple sclerosis 
foreign antigen are also provided. 

In another set of embodiments, pharmaceuticals are pro- 
vided for vaccination against a human pathogen implicated 
in the aetiology of autoimmune disease. These pharmaceu- 
tical preparations include a pharmaceutically acceptable 
carrier and an immunogenic preparation effective to immu- 
nize against a human pathogen. The human pathogen is one 
which in its native form includes a polypeptide having an 
amino acid sequence corresponding to a sequence motif for 
an HIA-DR protein which Is associated with the autoim- 
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mune disease. ITiese polypeptides are capable of binding to 
the HLA-DR protein to form a complex which activates T 
cells which become autoreactive and intialc the autoimmune 
disease. ITie preparations of the present invention specifi- 
cally do not include such polypeptides but, rather, include 
other antigens from the pathogen. That is, a vaccine is 
produced which specifically does not include polypeptides 
which correspond to the sequence motifs for the HLA-DR 
protein and, if known, the TCR contacts of the self epitope. 
Because pathogens present a wide array of antigenic 
determinants, one may eliminate those which correspond to 
the relevant sequence motif and produce a vaccine which is 
effective against the pathogen but which will not include 
peptides implicated in the autoimmune response. 

Such vaccines, lacking peptides corresponding to the 
sequence motifs of the present invention, may be made in 
any convenient manner by one of ordinary skill in the art. 
For example, when producing an influenza vaccine, one may 
compare the peptide sequences of the influenza vims to 
sequence motifs developed according to the present inven- 
tion. The vaccine may then be made excluding the proteins 
which have the motif sequences (e.g. by using a fragment of 
the virus) or recombinant techniques may be used to produce 
a vims in which the sequences corresponding to the motif 
are altered such that they do not match the motif. In 
preferred embodiments, the altered residues are the TCR 
contact residues and, in particular, a substitution which alters 
the charge of a TCR contact residue is preferred. Similar 
vaccines can be developed for bacterial pathogens using, for 
example, only portions of the bacterium (e.g. bacterial 
surface proteins or membrane associated proteins) which 
lack a motif associated with human autoimmune disease or, 
again, by genetically altering the vaccine bacteria so as to 
change residues. 

The motifs which are considered in creating such vaccines 
may be chosen on any of several grounds. If the pathogen to 
which the vaccine is directed is associated with an autoim- 
mune disease, a motif may be developed according to the 
methods described herein for an HLA-DR protein associated 
with the disease. If there is also a known or suspected 
autoantigen, the motif may include TCR contact residues of 
the self epitope. The pathogen's protein complement may 
then be compared to the motif and peptides corresponding to 
the motif may be omitted from the vaccine or may be altered 
by recombinant means to produce a vaccine free of such 
peptides. Alternatively, vaccines may be developed with 
particular populations in mind. For individuals suffering 
from or at risk of developing a particular autoimmune 
disease, special vaccines may be developed. In this case, the 
motif is again chosen on the basis of the HLA-DR protein 
associated with the autoimmune disease and, when known, 
the TCR contact residues of the self epitope. 

In particular embodiments, such vaccine preparations are 
provided in which the HLA-DR protein is HLA-DR4 protein 
and the autoimmune disease is pemphigus vulgaris. And, in 
particular, vaccines are provided lacking a peptide corre- 
sponding to the PV motif #1 disclose~d herein. Specific 
embodiments of the vaccines include vaccines lacking at 
least one of the peptides disclosed as SEQ ID NO.: 1 through 
SEQ ID NO.: 7. Thus, methods of immunizing an individual 
against pathogens which may cause pemphigus vulgaris are 
also provided. 

Similarly, vaccine preparations are provided in which the 
HLA-DR protein is HIA-DR2 protein and the autoimmune 
disease is multiple sclerosis. And, in particular, vaccines 
lacking a peptide corresponding to at least one of the three 
MS motifs disclosed herein are provided. Specific embodi- 
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merits of the vaccines include vaccines lacking al least one 
of the peptides disclosed as SEQ ID NO.: 3 through SEQ ID 
NO.: 15. In particularly preferred embodiments, the vac- 
cines are developed from the pathogens listed in Table 2 but 
lack either the entire protein listed as the antigen or lack al 
least the peptides identified in the corresponding sequences. 
Thus, methods of immunizing an individual against patho- 
gens which may cause multiple sclerosis are also provided. 

These peptides will also be useful in assessing which 
pathogcn(s) may be important in a particular patient. For 
example, the T cells from one patient may proliferate in 
response to one or a few of these peptides, while those from 
another patient may proliferate in response to a different 
peptide or set of peptides. Analogs of the peptides may be 
synthesized in which one of the T cell receptor contact 
residues is substituted. For example, in the case of MS, 
peptides with the substitution of MBP91F by 91 A or of 
MBP93K by 93 A may be employed. Such analogs are not, 
however, limited to the substitution of these primary T cell 
receptor contact residues of to substitution by particular 
amino acids such as A. These peptide analogs may be used 
to anergize (inactivate) autoreactive T cells (see, for 
example, Sloan-Lancaster et al., 1993 and 1994) upon 
administration to autoimmune patients. The viruses or bac- 
terial pathogens may be useful in immunization by selecting 
viral or bacterial strains that do not carry the mimicry 
epitope. Proteins from these pathogens other than those that 
carry the mimicry epitope may also be selected for immu- 
nization. This therapy may be useful in preventing reinfec- 
tion and thus remissions of the disease or in preventing 
initial infections in particularly susceptible populations (the 
most obvious example of which is a disease- free identical 
twin of a patient). 

EXAMPLES 

1. Identification of Self Epitopes of Pemphigus Vulgaris 35 

As noted above, pemphigus vulgaris (PV) is, in different 
ethnic groups, associated either with a DR4 allele 
(DRB1*0402) or with a rare DQ1 allele (DQB1*05032); 
only a small fraction of PV patients have neither suscepti- 
bility gene (Ahmed et al., 1991; Ahmed et al., 1990; Scharf 40 
et al., 1988). The PV associated molecule has a negative 
charge (Glu) at the critical position (J71; the neighboring 
position ((370) is also negatively charged. The DR4 subtype 
associated with PV is the only one that carries a negative 
charge at DRp 71 (a positive charge (Arg) is found at DRp 45 
71 in the RA associated DR4 molecules). Although 
polymorphic, the P7 pocket residue DRp 67 (Leu/He) does 
not appear to be involved in peptide binding but probably 
acts as a TCR contact residue (Stern et al., 1994). 

The charge of a polymorphic residue at DRp 71 could 50 
therefore account for susceptibility to two different autoim- 
mune syndromes associated with structurally similar DR4 
subtypes: DR4 alleles associated with susceptibility to rheu- 
matoid arthritis have a positive charge at DR(5 71 (Arg) 
white the DR4 allele associated with pemphigus vulgaris has 55 
a negative charge at DRP 71 (Glu). Peptides selective for 
either DR4 molecule may therefore differ significantly in 
their charge at P4: Peptides with a negative charge at P4 
would be expected to bind to the RA associated molecules 
but not the pemphigus associated DR4 molecule; in contrast, 60 
a positive charge would be expected for the pemphigus 
peptides) at position 4. Due to the conserved nature of these 
molecules, other peptide anchor residues (PI and P6) would 
not be expected to be different for these DR4 subtypes. 

A sequence motif for selective binding to the HLA-DR 65 
DRB 1*0402 protein was developed according to the method 
disclosed herein. 



'I "he p chain residues involved in forming the PI pocket 
for this allele arc p85 (Val), p86 (Val), 089 (Phc) and p90 
(Thr). Thus, the presence of Val at p86 (instead of Gly as in 
DRB1*0101) suggests that the PI position of the motif be 
restricted to V, L, I, M and F. Alanine might also have been 
included but was not in this example. The P6 pocket is 
formed in part by the pil (Val) and P13 (His) of the 
DRB1*0402 protein. Relative to the DRB1*0101 allele, in 
which these residues are Leu and Phe respectively, the P6 
pocket of the DRB 1*0402 protein is somewhat larger and 
more polar. Thus, for the P6 position of the motif, S, T, N 
and V were allowed. Finally, the P4 pocket of this DR 
protein is formed in part by the residues pl3 (His), p70 
(Asp), P71 (Glu), P74 (Ala) and p78 (Tyr). As noted above, 
the two negatively charged residues as p70 and p71 create 
a preference for positively charged antigen residues and, 
therefore, the motif for the P4 position was restricted to K 
and R. 

Thus, the sequence motif for the pemphigus vulgaris 
autoantigen was defined as: 



Position 
PV Motif #1: 



P2 
X 



P3 
X 



P4 
K 
R 



Although the autoantigen for pemphigus vulgaris is 
known, the precise epitopes within the autoantigen have 
previously remained unknown. Using the method of the 
present invention, however, it has been possible to identify 
a small set of peptides which may serve as the autoanligenic 
determinants. The target antigen of pemphigus vulgaris is an 
epithelial adhesion molecule of the cadherin family, desmo- 
glein 3 (Amagai et al., 1991). Desmoglein 3 mediates Ca* + 
dependent adhesion between keratinocytes; the autoantibod- 
ies interfere with cell adhesion with resulting blister forma- 
tion (Takeichi, 1990). The autoantibodies are thought to be 
pathogenic since a transient blistering disease is also seen in 
newborns of affected mothers due to transfer of maternal 
immunoglobulin to the fetus. Transfer of serum or desmo- 
glein 3 specific antibodies to mice also results in acantholy- 
sis (Amagai et al., 1992). 

Only seven peptides from this large protein (130 kDa, 999 
amino acids) matched the motif. These seven pemphigus 
vulgaris antigens (PVA.l-PVA/7) are presented below in 
Table 1 with the residues corresponding to the MHC binding 
positions PI, P4 and P6 underlined. 

TABLE 1 



PVA.1 (res. 78-03) 
PVA-2 (res. 97-1 H) 
PVA.3 (res. 190-204) 
PVA.4 (res. 206-220) 
PVA.5 (res. 251-265) 
PVA.6 (res. 512-526) 
PVA.7 (rot. 762-786) 



ATQKJTYRISGVGID 
KilFWDjCNTGDINt 
LNSKtAFXiySQEPA 

tpmfu^rntgevrt 
cecnt kvkd vndnfp 
sajoInnkytgpytf ■> 
qsgtm rtr hstggtn 



Therefore, the selective presentation of one or several of 
these peptides by the PV associated DRB1*0402 molecule 
to T cells may be critical for initiating autoimmunity in PV. 
To test this, T cell lines were raised from blood mononuclear 
cells of two patients with active disease by stimulation with 
the seven candidate peptides. T cell lines were expanded 
with rIL-2 and tested for recognition of the candidate 
peptides in a proliferation assay. T cell lines from both 
patients recognized two peptides from the extracellular 



5,874,531 



19 



domain of desmoglein 3 (PVA.3 and PVA.4) that were 
located close to the major autoantibody recognition site. 
These T cell lines were HI-A-DR restricted as T cell pro- 
liferation was blocked by a monoclonal antibody specific for 
HLA-DR but not by a control antibody. These desmoglein 3 
peptides are therefore candidates for the T cell dependent 
induction of autoimmunity in pemphigus vulgaris. 

The molifs of the present invention relating to the HLA- 
DR proteins can be taken one step further. In a different 
ethnic group PV is associated with a rare DQ1 subtype 
(DQB 1*05032) that differs from the common DQ1 subtype 
only at position 57 of the DQfi chain (Sinha, et al, 1988). In 
the PV associated molecule DQp 57 is negatively charged 
(Asp) whereas in the common DQ1 subtype it is not. The 
same position on the DQp chain has also been implicated in 
susceptibility to diabetes. In diabetes, however, the reverse 
is true: DQ2 and DQ8 molecules associated with suscepti- 
bility to diabetes do not have a negative charge at DQ|i 57 
(Todd et al„ 1987). 

Based on these observations it becomes clear that two 
polymorphic positions in the MHC class II p chain (position 
71 of DRp and position 57 of DQp) are critical for selective 
peptide binding and the development of autoimmunity. 
Based on the criteria described above, a diabetes linked 
peptide would be expected to have a negative charge at P9 
since such a peptide would only bind to DQ molecules that 
do not have the same charge at DQp 57. In contrast, for the 
DQ1 associated cases of pemphigus, a peptide with a 
positive charge at P9 may be selective for the disease 
associated molecule which carries a negative charge at DQP 
57. In the case of DR4 linked autoimmunity, the charge at 
peptide position 4 confers selectivity to the disease associ- 
ated DR4 molecule: RA peptides have a negative charge at 
P4, PV peptides a positive charge at P4. Molifs for selective 
peptide binding may therefore prove to be tremendously 
useful in the identification of key epitopes that initiate 
human autoimmune diseases. This approach is expected to 
be useful not only for identifying peptides in PV, RA or 
diabetes but also for other autoimmune diseases where 
residues critical in peptide binding have beeu linked to 
disease susceptibility. 

2. Identification of Mimics of the Multiple Sclerosis MBP 
Autoantigen 

Susceptibility to MS is associated with HLA-DR2 (DRA, 
DRB 1*1501, the most common subtype of DR2) (Spielman 
et al. f 1982; Olerup el at, 1989). This MHC class II 
molecule is believed to play a critical role in the immuno- 
pathogencsis of MS by presenting immunodominant self 
peptides to autoreactive T cells. Following injection of MBP 
in experimental animals, T cells specific for immunodomi- 
nant peptides of MBP mediate an inflammatory response in 
the CNS that can be accompanied by marked demyeli nation 
(reviewed in Zamvil and Steinman, 1990). In previous 
studies two regions of human MBP were found to be 
immunodominant (residues 84-102 and 143-168) (Ota et al. 
1990; Pette et al. 1990; Martin el al. 1990; Wucherpfennig 
et al. 1994a). Reactivity to the MBP(84-102) peptide was 
predominantly seen in subjects carrying HI.A-DR2. Using L 
cell transfectants as antigen presenting cells, HLA-DR2b 
(DRA, DRB1*1501) was found to serve as the restriction 
element for these MBP(84-102) specific T cell clones. 

The MBP(84-102) peptide binds with high affinity to the 
HLA-DR2b molecule with two hydrophobic residues serv- 
ing as the anchors PI (Val 89 of MBP) and P4 (Phc 92 of 
MBP) (Wucherpfennig et al., 1994a; Vbgl et al. 1994). At 
position PI, Val 89 can be substituted by other aliphatic 
amino acids (Leu, lie), as well as by methionine and 
phenylalanine; alanine is tolerated at this position but 
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reduces the affinity of the peptide for HLA-DR2b. Al 
position P4, all aliphatic and aromatic residues arc permit- 
ted; again alanine is tolerated but results in a loss of binding 
affinity. 

A mutational analysis of putative TCR contact points 
confirms that P3 (Phe 91) and P5 (Lys 93) are TCR contacts 
for the MBP(85-99) specific clones; other residues such as 
P-l (Val 88) and P2 (His 90) are important for some clones 
but not for others. Substitution of P3 (Phe 91) by alanine 
abolishes TCR recognition for all clones; some clones 
tolerated conservative substitutions (eg. tyrosine or aliphatic 
amino acids) while other clones do not. Substitution of P5 
(Lys 93) by arginine Is tolerated by most T cell clones but 
more drastic changes frequently result in a partial or com- 
plete loss of T cell reactivity. This analysis demonstrates that 
P2 (His 90), P3 (Phe 91) and P5 (Lys 93) are the primary 
TCR contact residues for MBP while PI (Val 89) and P4 
(Phe 92) are the primary MHC contact residues for MBP. 
Thus analysis also confirms the present teaching that the 
TCR contact residues of a motif should be highly if not 
absolutely conserved. 

Based on this structural characterization of the immun- 
odominant MBP(85-99) peptide, three sequence motifs 
were developed to search a protein database for viral and 
bacterial peptides that matched these requirements. The 
motifs focused on the core region of the peptide, residues 
P-l to P5 (88-93 of the MBP protein), which contained the 
MHC and TCR contacts common to all clones. In the first 
motif, aliphatic amino acids were allowed at the first MHC 
anchor residue PI while both aliphatic and aromatic residues 
were permitted at the second MHC anchor P4. For the TCR 
contacts, the Phe 91 at P3 was absolutely conserved, the Lys 
93 at P5 could be substituted only by arginine, while the His 
90 at P2 and the Val 88 at P-l could be substituted by several 
structurally related amino acids. Thus, the first motif for 
multiple sclerosis antigens was defined as: 
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The second motif omitted the Val 88 P-l as a TCR contact 
residue (used only by some clones) and permitted also 
aromatic amino acids at the first MHC anchor PI (Val 89). 
This was done since the MBP(85-99) peptide is presented 
by different HLA-DR2 subtypes: Presentation by 
DRB1*1501 requires an aliphatic amino acid or phenylala- 
nine at this position while aliphatic and all aromatic residues 
at this position can serve as anchor for DRB1*1602. This 
difference, as explained above, relates to the size of the 
primary pocket for binding this hydrophobic residue and is 
determined by the Val/Gly dimorphism at DRp86 (Val in 
♦1501 and Gly in *1602) (Busch et al., 1991). Thus, the 
second motif for multiple sclerosis antigens was defined as: 
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-continued lhe MBP(85-99) peptide. Seven clones were tested with the 

viral/bacterial peptides selected according to the sequence 
motifs. 

Three of the seven clones tested were efficiently activated 

by several viral/bacterial peptides. The first clone (Hy. 1 B 1 1) 

was HLA-DQ1 restricted whereas the other two clones 

(Hy.2Ell and Hy.lGll) were HLA-DR2 restricted 

(Wucherpfennig el al., 1994a). Among the 70 peptides 

selected according to motifs #1 and #2, three mimicry 

The third sequence motif represented a modification of 10 peptides stimulated the DQ1 restricted clone and two pep- 

the TCR contact residues preferred by a subgroup of MBP tides stimulated both of the DR2 restricted T cell clones. 

(85-99) specific clones. For these clones, P5 (Lys 93) was Among the group of 59 peptides selected according to motif 

absolutely conserved while P3 (Phc 91) could be substituted #3, one peptide was identified for the DQ1 restricted clone 

by some aromatic or aliphatic amino acids. The third motif while two were identified for the DR2 restricted clones. 

for multiple sclerosis antigens was defined as: 15 Taken together, the DQ1 restricted T cell clone recognized 

five structurally related peptides: the immunodominant 

MBP(85-99) peptide, three viral peptides (from Herpes 

simplex, Adenovirus type 12 and Human Papillomavirus) 

and a bacterial peptide {Pseudomonas aeruginosa). Two of 

20 the DR2 restricted clones were activated by four peptides. 

Both clones recognized the MBP(85-99) peptide as well as 

peptides from EBV and Influenza virus. In addition, one 

clone recognized a viral peptide from Reovirus (clone 

Hy.2Ell) and one recognized a peptide from Herpes sim- 

„ . ->< plex virus (clone Hy.lGll). These results, and the sequences 

These HLA-DR motifs also matched well with the struc- 25 ^ ^ belQW m Tabk 2 

tural requirements for an HLA-DQl restricted clone specific r r 

for the MBP(85-99) peptide. This clone required the same 

minimal peptide segment as DR2 restricted clones (residues 

87-97). As in the DR2 restricted clones, P2 (His 90), P3 (Phe Peptides Recognized by Clone Hy.lBll (DQl Restricted): 

91) and P5 (Lys 93) appeared to be the primary TCR contact 30 

residues. Substitution of these hydrophobic positions bv MBP(85-W) enpvvhffknivtpr 
aspartic acid greatly diminished the stimulatory capacity of J«^r ™v™S 

the peptide While Substitutions by Other hydrophobic amino Pwdomanas aeruginosa, Phosphomanno- drixmlfakdwsrn 

acids were tolerated. These data suggest that the MBP mmase 

(85-99) peptide is bound in a Similar fashion to HLA-DR2b 35 Human Papillomavirus Type 7, 12 Protein IGGRVHFFKDISPIA 
and to HLA-DQl and that the same peptide residues arc Peptides Recognized by Clone Hy.2Ell (DR2 Restricted): 

critical for interaction with the TCR. MBP(85-99) ENPVVHFFKNIVTPR 

These motifs were used as search criteria in a search of Epstein-Barr Vims, dna Polymerase tggvyhfvkkhvhes 
protein databases (PIR and SwissProt) using the Genetics Influenza Type A, Hemagglutinin yrnlvwrkkntryp 
Computer Group .software (program: findpaUerns). More «o ^^X£St"o~ Hy^lTS^gSf 0 0 

than 600 sequences of viral and bacterial origin were iden- — c E 

lifted that matched these criteria. From this set, sequences mbp(85-99) entwhffknivtpr 
were selected based on the following criteria: (1) Viruses Epstein-Barr Vims, dna polymerase tggvyhfvkkhvhes 
known ,0 cause human pathology, (2) Viruses prevalent in 

the Northern Hemisphere where MS occurs most frequently, 45 
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(3) Selected bacterial sequences associated with inflamma- 
tory CNS disease (such as Borretia burgdorferi) and with For these viral/bacterial peptides to be involved in the 
invasive infections (such as Staphylococcus aureus, Kleb- initiation of autoimmunity, they have to be capable of potent 
siella pneumoniae and Pseudomonas aeruginosa). Not TccU stimulation that results in marked clonal expansion of 
included were most viruses that cause infections in tropical 50 autoaggressive T cell clones. The stimulatory capacity of 
countries, sequences derived from vaccinia virus and a large each of these peptides was therefore compared to the MBP 
number of sequences from E. coli (which is part of the (85-99) peptide in a titration experiment. The peptides were 
normal intestinal flora). When multiple antigenic variants found to be efficient stimulators of the MBP specific T cell 
were present, one or several sequences that best fit the motifs clones; in particular the EBV peptide (DR2 restricted 
were chosen. Tie selected peptides were synthesized by 55 clones) and the adenovirus peptide (DQl restricted clone) 
Pin-Technology on a 1 mg scale (Chiron Mimotopes, San were similar to the MBP(85-99) peptide in their stimulatory 
Diego). Seventy peptides were made which fit motifc #1 and capacity. These results demonstrate that foe T cell activation 
#2 and 59 peptides for motif #3, is not the result of a minor degree of 'cross-reactivity' but 
These peptides were then tested for their ability to activate rather the result of structural similarity sufiGcient for potent 
human MBP(85-99) specific T cell clones that had been 60 T cell activation. 

previously established from blood T cells of two patients Comparison of the peptide sequences that were found to 

with relapsing-remitting MS (Wucherpfennig et al., 1994a; stimulate the same TCR revealed several interesting points: 

Wucherpfennig et al., 1994b). Homozygous B cell lines that (1) Only one peptide (Human Papillomavirus L2 protein) 

expressed DR2 (DRB1M501 or DRB1M602) or DQl were had striking sequence similarity with the MBP(85-99) 

used as antigen presenting cells (APCs) in these T cell 65 peptide in that all amino acids in the MBP(89-95) 

proliferation experiments (Wucherpfennig et al., 1994a). As segment except position 94 (Asn to Asp) were identical 

a positive control, all clones were shown to be activated by (Table 2). For all other sequences, simple alignment 
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would not have predicted them lo be efficient stimula- 
tors of MBP(85-99) specific T cell clones. Therefore, 
absent the method of the present invention, these pep- 
tides would not have been identified. 

(2) At positions not specified by the search criteria, the 5 
selection for particular amino acids was still apparent 
(Table 2). For the DQl restricted clone, for example, 
aspartic acid was selected at position P6 (residue 94 of 
MBP), a probable TCR contact residue, in all four 
peptides. This position is occupied by asparagine in the 10 
MBP peptide (similar size, but no negative charge). 
Substitution of Asn 94 for Asp in the MBP peptide 
markedly increased its stimulatory capacity for the 
DQl restricted clone but reduced it for the DR2 
restricted clone. Selection also occurred at the neigh- 15 
boring P7 MHC contact (He 95) for which lie, Val or 
Phe were selected (all hydrophobic). 

(3) Different selection events occurred for the DQl and 
the DR2 restricted clones: At position P6 (94 of MBP), 
there was selection of aspartic acid (negative charge) 20 
for DQl peptides whereas there was selection for lysine 
(positive charge) in two of the three peptides presented 
by DR2. 

(4) In the flanking segments (residues 85-87 and 97-99), 
no apparent selection look place as amino acids with 25 
different size and charge were allowed. 

The majority of the viruses from which these peptides 
were identified are common human pathogens: Influenza 
Type A frequently causes respiratory tract infections; Human 
Papillomavirus infects epithelial tissues and has been linked 30 
to cervical carcinomas; and Epstein-Barr Virus (EBV) 
causes an acute viral syndrome (infectious mononucleosis) 
in young adults. Human Herpesvirus I (Herpes simplex), 
EBV and Human Papillomavirus cause latent or persistent 
infections with neurons (Herpes simplex), B cells (EBV) 35 
and epithelial cells (Papillomavirus) serving as the reser- 
voirs. Viral expression can be reactivated by UV exposure 
and stress (Herpes simplex) and by B cell activation (EBV) 
(Schwarz ct al, 1985; Epstein ct al., 1977; Spruance 1985; 
Tovey et al., 1 978). For the induction and maintenance of an 40 
autoimmune response these persistent viral infections are of 
particular interest as they could explain the chronicity of the 
clinical disease and the clonal expansion and persistence of 
MBP specific T cells. Reactivation of viral expression may 
also be involved in triggering clinical relapses. By this 45 
mechanism, viral peptides could activate resting MBP spe- 
cific T cells in periphery and allow them to invade the CNS. 

Are these foreign epitopes actually presented to autore- 
active T cells during a viral infection? The peptide from the 
EBV DNA polymerase allowed this question to be 50 
addressed. In EBV transformed B cells (which were used as 
antigen presenting cells in the T cell assays) the lytic viral 
cycle is repressed. The DNA polymerase gene is not tran- 
scribed in this latent stale; however, B cell activation results 
in activation of the lytic cycle and in the expression of the 55 
DNA polymerase gene (Datta el al., 1980). To examine 
MHC class II restricted presentation of the EBV DNA 
polymerase, an HLA-DR2* EBV transformed B cell line 
(MGAR) and an MIIC mismatched control (9001, IILA- 
DR1) were pretreated for 36 hours with phorbol ester which 60 
was removed by extensive washing prior to coculture of 
APC with T cells. T cell clones Uy.2Ell and Hy.lGll, 
which recognize the EBV DNA polymerase peptide pre- 
sented by HLA-DR2, were activated by an HLA-DR2* EBV 
transformed B cell line pretreated with phorbol ester. This 65 
effect was specific because MHC mismatched B cells did not 
activate the clones; also, a control clone (Ol>.lA12) that 



recognized MBP(85-99) but not the EBV peptide was not 
activated. In a separate experiment, T cell activation was 
blocked by a mAb specific for HLA-DR (mAb L243) but not 
by a mAb specific for HLA-DQ (G2a.5). 'I'hese results 
demonstrate that the MBP specific T cell clones recognize 
not only the viral peptide but also antigen presenting cells 
infected with the virus. In vivo this recognition event could 
lead to chronic antigenic stimulation of MBP specific T cells 
as B cell activation results in the expression of EBV genes, 
including the DNA polymerase gene. 

Finally, the presentation of the viral peptides by different 
DR2 subtypes was compared to determine if they arc 
efficiently presented by the disease associated molecule 
(DRB1*1501, the most common DR2 subtype). The MBP 
peptide was presented by three of the four DR2 subtypes; the 
peptide was not presented by DRB1*1601 which differs 
from DRB1*1602 by a single amino acid substitution (at 
position DRp67, a possible TCR contact). The two viral 
peptides were presented much belter by the DR15 molecules 
(DRB1*1501 and 1502 which differ only at position 
DR(J86) than by DRB 1*1602. This was particularly evident 
for the influenza peptide which only activated the T cell 
clone when presented by the DRB 1*1501/1502 molecules 
but not by DRB1*1602. These results indicate that the viral 
peptides identified herein are preferentially presented by the 
MS associated DR2 molecule (DRB1*1501). 

DEFINITIONS 
For clarity of interpretation and to clearly and distinctly 
point out the subject matter of the claimed invention, the 
following definitions are provided for several terms used in 
the claims appended hereto. 

The term "sequence motif," in accordance with the 
description provided herein, means a series of restrictions on 
the residues which may occupy certain relative positions of 
an amino acid sequence. A sequence motif must restrict at 
least three and preferably four or five positions of an amino 
acid sequence. The relative positions of the first (N-terminal) 
and last (C-terminal) restricted amino acid positions shall be 
separated by at least two but no more than twelve amino acid 
residues. For example, PI and P4 may be the first and last 
restricted residues and these residues are separated by two 
residues. As another example, P-l and Pll may be the first 
and last restricted residues and these are separated by ten 
residues. Positions between the first and last restricted 
positions may be restricted or unrestricted with the excep- 
tion that a total of at least three positions of the motif must 
be restricted. Of the three positions which must be restricted, 
at least two must be residues corresponding to major MHC 
binding pockets. If only two of the restricted residues 
correspond to MHC binding residues, the third must corre- 
spond to a TCR contact residue. Further, at least one of the 
positions restricted must correspond to either the PI or P4 
binding position. By "restricted" is meant that at least one, 
and preferably ten, amino acid residues shall be excluded 
from a position. 

An amino acid sequence "corresponds" to a sequence 
motif if it can be aligned with the positions of the sequence 
motif such that, at each restricted position of the motif, the 
amino acid sequence includes a residue which is not 
excluded from that position by the restrictions which define 
the motif. As the restrictions which define the motif are 
derived from the size and nature of the MHC binding 
pockets of HI-A-DR proteins and, optionally, the TCR 
contact residues of known epitopes, the restricted positions 
of the binding motif may also be said to correspond to the 
MHC binding pockets and TCR contact residues. 
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ITic term "isolated," in reference to a protein or 
polypeptide, means separated from its native or natural 
chemical microenvironment. Thus, a polypeptide isolated 
from a bacterium shall be in a preparation substantially free 
of most of the other bacterial polypeptides and, similarly, an 5 
isolated viral polypeptide preparation shall be substantially 
free of the other polypeptides which comprise the virus. 

The term "associated with," as used in connection with a 
particular HLA-DR protein and an autoimmune disease or 
autoimmune response, shall mean that the protein and the J0 
disease/response have been positively correlated by clinical 
or epidemiological studies such that the likelihood of devel- 
oping the disease/response has been shown to be increased 
by presence of the protein. 

Ilie term "HLA-DR protein" means the particular protein 
product of a particular allele of the MHC class II HLA-DR 
gene. A disease associated with an HIA-DR protein is one 
associated with such a particular protein and not merely with 
the HLA-DR gene locus. 

By the term "human pathogen" is meant a bacterium, 
virus or protozoan capable of infecting humans and gener- 
ating an immune response. The term specifically is intended 
to exclude the bacteria which form part of the normal human 
intestinal flora. By this term, "normal human intestinal 
flora," is meant the bacteria which normally inhabit the 
human gut, such as Escherichia coli, but which do not 
normally cause disease. 

By the term "autoreactive," as applied to T cells, is meant 
T cells from a human which are activated by a human self 
epitope. By "activation" of T cells is meant induction to 
proliferate, secrete lymphokines (cytokines) and/or to ini- 30 
tiate effector activity (e.g. cytotoxicity). 

By the term "autoantigen" is meant a self protein or 
polypeptide which includes a "self epitope." By "self 
epitope" is meant that part of an autoantigen which is 
recognized by T cells when bound to and presented by an 35 
MHC molecule. 

By the term "effective amount," with respect to tolerizing 
an individual to an antigen, is meant an amount of the 
antigen sufficient to render T cells, otherwise specific for the 
antigen, unresponsive to the antigen when bound to and 40 
presented by an MHC molecule. T cells which are unre- 
sponsive fail to activate when presented with the antigen for 
which they are specific. By the term "effective amount," 
with respect to immunizing an individual to an antigen, is 
meant an amount sufficient to induce an immune response 
which results in '1* cells specific for the antigen. Typical 
ranges of dosages arc from 1 nanogram/kilogram to 100 
milligrams/kilogram or even 500 milligrams/kilogram. 
Effective amounts will vary according to such factors as age, 
sex and sensitivity to the antigen. 

By the term "core MHC binding residues" is meant the 
residues of an epitope corresponding to the P-l to P9 
positions of a peptide bound to an HLA-DR molecule. By 
the term "inner core MHC binding residues" is meant those 
residues of an epitope corresponding to the P-l to P6 
positions of a peptide bound to an IILA-DR molecule. 
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StQUUNCH LISTING 



( I ) OENERAI. INFORMATION: 

(ill) NUMBER OF SEQUENCES: 16 



( 2 ) INFORMATION FOR SEQ ID NO: I: 
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( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 amino acids 
{ B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOIECULE TYPE: pcplidc 

( i i i ) HYPOTKETICA I ^ NO 

( v ) FRAGMENT TYPE: inlcmal 

t v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: I: 

Ala Tbr Gin Lys Me Tht 
1 5 



( 2 ) INFORMATION FOR SEQ ID NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 
t A ) LENGTH: 15 amino acids 
( R ) TYPE; ara'mo acid 
( D ) TOPOLOGY: I bear 

( i i ) MOLECULE TYPE peptide 

( i i i ) HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Phe Gly Me Phe Vi I Val Asp Lys Asn Thi CI y Asp Me Asn Mc 
1 5 10 15 



( 2 ) INFORMATION FOR SEQ ID NO:3: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 15 amino acid* 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i i I > HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: inlcmal 

( v i ) ORIGINAL SOURCE: 

< A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Leu Asn Ser L y * Me Ala Phe Lys Mc Val Ser Gin Glu Pro Ala 
1 5 10 15 



( 2 ) INFORMATION FOR SEQ ID NO:4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 amino acids 
C B ) TYPE- amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

(til) HYPOTHETICAL NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE 

( A ) ORGANISM: HOMO SAPIENS 



T y r A r g Me 



S e i 
1 0 



Gly Val Gly Me 



Asp 
1 5 
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( x i ) SEQUENCE DESCRIPTION: SKQ ID NO:4: 

Tht Pio Met P h e Leu Leu Set A i <: Asn Thi Gly G 1 u V a I Aig Thr 
1 5 10 15 

( 2 ) INFORMATION FOR SEQ ID NO: 5; 

( i ) SEQUENCE CHARACTERISTICS: 

{ A ) IENGTH: 15 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

< v ) FRAGMENT TYPE; internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

C,» G I o C y * Asn Mc I. v s Val L v s Asp Val Asn Asp Asn Phe Pro 
1 " 5 10 15 

( 2 ) INFORMATION FOR SEQ ID NO:6: 

( i ) SEQUENCE CHA RACIER 1ST! CS: 
( A ) LENGTH: 15 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:6; 

S«r Ala Arg Thr Leu Asn Asn Arg Tyr Thr Gly Pro Tyr Thr Phe 
1 5 10 15 

( 2 ) INFORMATION FOR SEQ E> NO:7: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

{ i i i ) HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Gin S«r Gly Thr Mel Arg Thr Arg His Ser Thr Gly Gly Thr Asn 
I 5 10 15 

( 2 ) INFORMATION FOR SEQ ID NO:& 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) I-ENGTH: 15 amino acids 
( B ) TYPE: amino acid 
( D > TOPOLOGY: linear 
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( i i ) MOl JiCULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HERPES SIMPLEX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Phe Arg Gin Leu Val His Phc Val Arg Asp Phe Ala Gin Leu L«u 
i > 10 IS 

( 2 ) INFORMATION FOR SEQ ID NO:9: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 15 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

{ v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

I A ) ORGANISM: ADENOVIRUS TYPE 12 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:*): 

Asp Phc Gl u Val Val Tbi Phe Leu Lys Asp Val Leu Pro G I o Pbe 

I 5 10 IS 

( 2 ) INFORMATION FOR SEQ ID NO:10: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH; 15 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOI.ECI :i£ TYPE: peptide 

( i i i ) irVPOTHETICAl-: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

t A ) ORGANISM: PSEL'DOMONAS AERUGINOSA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l0t 

Asp Arg Leu Leo Mcl Leu Phc Ala Lys Asp Val Val Ser Arg Asn 
1 5 10 15 

( 2 ) INFORMATION FOR SEQ ID NO:11: 

( i )SFX}lJF>TECrURACrFJUSXICS: 
( A ) LENGTH: 15 amino acids 
( 13 ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

MM ) HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HUMAN PAPILLOMAVIRUS TYPE 7 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Mc CI y Gl y Arg Val His Phe Phe Lys Asp Me Ser Pro Me Ala 
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( 2 ) INFORMATION FOR SEQ ID NO: 12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 1 5 iraino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOIECUIETYPE: peptide 

( i i i ) HYPOTHETICAI.: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: EPSTEIN BARR VIRUS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Thr Gly C I y Vt 1 Tyi His Phe Vi I Lys Lys His Va I His CI i Ser 
1 5 ' 10 15 



( 2 ) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 amino acids 
( B ) TYPE: arainrt acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: INFLUENZA VIRUS TYPE A 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tvr Arg Asa Leu Vat Trp 

1 ' 5 



( 2 ) INFORMATION FOR SEQ ID NO:14: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 15 amino acid* 
( B ) TYPE: amino acid 
( D ) TOKOLOGY: lines i 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL NO 

( v ) FRAGMENT TYPE: internal 

( v I ) ORIGINAL SOURCE: 

( A ) ORGANISM: REO VIRUS TYPE 3 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Arg All All Pbc Leu Phe Lys Thr Val Gly Phe Gly Gly 
1 5 10 15 



( 2 ) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 amino acid* 
( B ) TYPE: amino avid 
t D ) TOPOLOGY: linear 

( i i ) MOIECULE TYPE: peptide 

( ■ i i ) HYPOTHETICAL; NO 

( v ) FRAGMENT TYPE: internal 



L y 



L y s 
I 0 



Asn Thr A r £ Tyr 



P r o 
1 5 



37 



5,874,531 

-continued 



38 



{ v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HERPES SIMPLEX VIRUS 

( x i ) SEQUENCE DESCRIPTION: SEQ IDNO:15: 

Gly Gly Afg Aig Leu Pbe Plic Vj I Ly* Ala Hi* V» I Aig CI d Sef 
1 5 10 15 



( 2 ) INFORMATION FOR SEQ ID NO:16: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IS amino acids 
( B ) TYPE: amino acid 
( U ) TOPOLOGY: Imcar 

( i j ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL; NO 

( v ) FRAGMENT TYPE: internal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: HOMO SAPIENS 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.16: 

Gin Axn Pro Val Vj I Hit Phe Phc L y * A * n Me V»l Thr Pro Arg 
1 5 10 15 



We claim: 

1. An isolated polypeptide consisting essentially of an 
amino acid sequence selected from the group consisting of 
SEQ ID NO.: 1, SEQ ID NO.: 2, SEQ ID NO.: 3, SEQ ID 
NO.: 4, SEQ ID NO.: 5, SEQ ID NO.: 6, and SEQ IDNO.: 
7. 

2. An isolated polypeptide consisting essentially of an 
amino acid sequence selected from the group consisting of 
SEQ ID NO.: 8, SEQ ID NO.: 9, SEQ ID NO.: 10, SEQ ID 



NO.: 11, SEQ ID NO.: 12, SEQ ID NO.: 13, SEQ ID NO.: 
14, and SEQ ID NO.: 15. 

3. A composition comprising a pharmaceutically accept- 
able carrier and a polypeptide of claim 1. 

4. A composition comprising a pharmaceutically accept- 
able carrier and a polypeptide of claim 2, 

***** 
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116 Section II tYMfHOOU Specificity and Activation 



T lymphocytes play a central role in specific im- 
mune responses to protein antigens. In Chapter 5, we 
introduced the concept that the physical forms of anti- 
gens recognized by T cells are actually peptide frag- 
ments that are derived from protein antigens and are 
bound to cell surface proteins encoded by genes of the 
major histocompatibility complex (MHC). Peptides 
bound to class 1 MHC molecules are typically derived 
from proteins synthesized in the cell that displays them 
(i.e., "endogenous antigens") and are recognized by 
CD8 + T cells, which are usually cytolytic T lymphocytes 
(CTLs). CTLs provide a major host defense mechanism 
against intracellular microbes. In contrast, peptides de- 
rived from proteins in the extracellular environment 
("exogenous antigens") are displayed in association 
with class II MHC molecules and are recognized by 
CD4 + T cells, which are usually helper T lymphocytes. 
Helper T cells are required for the induction of humoral 
and cell-mediated responses, which are most effective 
in eliminating extracellular pathogens. In this chapter, 
we will describe in greater detail the characteristics of 
the peptide-MHC molecule complexes, the nature of the 
cells that form and display these complexes, and the 
mechanisms by which cells convert endogenous and 
exogenous protein antigens to peptides that can bind to 
MHC molecules. The activation and effector mecha- 
nisms of T cell subsets that occur subsequent to recog- 
nition of peptide-MHC complexes will be discussed in 
later chapters. 

Characteristics of Antigen 
Recognition by T 
Lymphocytes 

Our current understanding of T cell antigen recog- 
nition is the culmination of a vast amount of work that 
began with studies of the physicochemical forms of 
antigens that stimulated cell-mediated immunity. 
These studies led to the discovery that cells other than 
T lymphocytes play an obligatory role in T cell activa- 



tion by foreign antigens, and later to the elucidation < 
the function of MHC molecules in T cell antigen reco; 
nition. 

The Forms of Antigens 
Recognized by T Lymphocytes 

The specificity of T lymphocytes for complexes < 
peptides and MHC molecules determines several cha: 
acteristics of T cell antigen recognition, which differ i 
fundamental ways from antigen recognition by ant 
body molecules. 

1 . T lymphocytes recognize only peptides, where* 
B cells can specifically recognize peptides, protein 
nucleic acids, polysaccharides, lipids, and small chem 
cals. As a result, T cell-mediated immune response 
are induced only by protein antigens (the natur 
source of foreign peptides), whereas humoral immur, 
responses are seen with protein and non-protein ant 
gens. Some T cells are specific for chemically reacth 
forms of haptens such as dinitrophenol. In these situ, 
tions, it is likely that the haptens bind to cell surfac 
proteins, including MHC molecules, and peptides cot 
taining these hapten conjugates are recognized by 
cells. 

2. T cells recognize only linear determinants < 
peptides defined predominantly by primary amino aci 
sequences that assume extended conformations withi 
the peptide-binding clefts of MHC molecules. In coi 
trast, B cells specific for protein antigens may recogni? 
conformational determinants that exist when proteir 
are in their native tertiary (folded) configuration or d< 
terminants that are exposed by denaturation or pr< 
teolysis. Thus, when an animal is immunized with 
native protein, the antigen-specific T cells that are stin 
ulated will respond to denatured or even proteolyt 
cally digested forms of that protein. In contrast, ant 
bodies produced by B cells after immunization with th 
native protein react only with the native protein (Tab! 



TABLE 6-1. Qualitative Differences io Antigen Recognition 
by T and B Lymphocytes 



Secondary Immone Response 



TCtU-Mediated 

Immunizing Secondary B Cttt -Mediated (Delayed -Type) 

Antigen Antigen Exposure (Antibody Production) Hypertensitivity 



Native protein Native protein + + 

Denatured protein Native protein - + 

Native protein Denatured protein - + 

Denatured protein Denatured protein + + 



Antigen recognition by T and B lymphocytes is qualitatively different In an immunized 
animal, B cefls are specific for amfbnnational determinants of the innminogen and, 
therefore, distinguish between native and denatured protein antigens. T cefls, however, do 
not distinguish between native and denatured protein antigens because T cefls recognize 
linear epitopes on short peptides derived from the intact proteins by proteolysis. 
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6-1). Consistent with this difference in the nature of 
antigenic determinants for T and B cell recognition is 
the finding that T cell responses to a soluble antigen 
cannot be inhibited using antibodies specific for con- 
formational determinants of that antigen, whereas anti- 
gen recognition by B cells can be competitively inhib- 
ited by such antibodies. 

3. T cells recognize and respond to foreign peptide 
antigens only when the antigen is attached to the sur- 
faces of other cells, whereas B cells and secreted anti- 
bodies bind soluble antigens in body fluids or cell sur- 
face antigens. This is because MHC molecules form part 
of the complex that T cells recognize, and these mole- 
cules are cell surface-bound integral membrane pro- 
teins. The display of peptide-MHC complexes in a form 
that can be recognized by T cells is called antigen 
presentation. Cells that display antigens in this form 
are called antigen-presenting cells (APCs), The 
properties and functions of APCs are discussed later in 
the chapter. Although historically the term APC has 
most often been used to describe cells that present 
antigen to CD4 + helper T lymphocytes, it is now appro- 
priate to describe target cells of CTL lysis as APCs as 
well, since CTLs also recognize peptide-MHC com- 
plexes on the surface of these target cells. 



Physicochemical Features of 
Peptide-MHC Complexes 

In order to stimulate T ceil responses, peptides 
derived from protein antigens must bind to MHC mole- 
cules and must remain stably bound long enough to 
allow specific T cells to engage the complex. We will 
first describe the characteristics of the peptide-MHC 
complexes that T cells recognize, and later we will de- 
scribe how these complexes are formed. The structural 
basis for peptide binding to both class I and class II 
MHC molecules has been analyzed by three ap- 
proaches: (1) binding assays of peptides to purified 
MHC molecules in cell-free solutions; (2) x-ray crystal- 
lography analyses of purified MHC molecules with 
bound peptides; and (3) amino acid analysis of peptides 
eluted from MHC molecules purified from cell mem- 
branes. The following are the main features of MHC- 
peptide interactions elucidated by these approaches: 

1. The association of antigenic peptides and MHC 
molecules is a saturable, low-affinity interaction (K d = 
10' 6 M) with a slow "on rate" and a very slow "off rate. " 
These features were determined first by the techniques 
of equilibrium dialysis (see Chapter 3 and Fig. 6-1) and 



fiGURE 6-1. Demonstration of 
peptide binding to MHC molecules 
by equilibrium dialysis. Purified I -A" 
class ft MHC molecules bind a pep- 
tide fragment of hen egg lysozyme 
(HEL), HEU46-61), that is labeled 
with a fluorescent marker (NBD), 
which allows the concentration of 
the peptide to be determined (A). 
Analysis of the amount of HEL 
bound (i.e.. the concentration 
within the dialysis membrane mi- 
nus the concentration outside) at 
varying concentrations of peptide 
shows that the binding is saturable 

(B) . Scatchard analysis shotvs that 
each M* molecule has approxi- 
mately one binding site (X-axis in- 
tercept), and the dissociation con- 
stant (K d ) is approximately i x 
/0 -6 M (calculated from the slope 
of the line). In this plot, r repre- 
sents the number of peptide mole- 
cules bound to each MHC molecule 
when there is an excess of peptide 

(C) . Note that HEL(46-61) is 
known to be presented in associa- 
tion with l-A k molecules. (Modified 
with permission from Babbitt. B. 
P, P. M. Allen, C. Matsueda, E. Ha- 
ber, and E. R Unanue. Binding of 
immunogenic peptides to la histo- 
compatibility molecules. Nature 
3 1 7359-361 . 1985. Copyright © 
1985, Macmillan Magazines Ltd) 
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